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Abstract 

This work is divided into two main parts. The first part is devoted to exploring the con- 
nectivity of random subgraphs of cartesian products of K\, K2, and P3. In the second part, 
the author presents a short review of the results about network reliability. 

The cartesian product of K2, the complete graph with 2 vertices, is the cube graph Q n . A 
random subgraph of Q n , Qp n , contains all vertices of Q n , and each edge of Q n independently 
with probability p n . One can call p n the percolation parameter. The author explains in detail 
that for p n > 1 — (l/2)(logn) 1 /™, Q™ n has no components with size larger than 1 and smaller 
than 2", as n — s- 00. It is also explained that for p n = 1 — (1/2) A 1 /™ (1 +o(l/n)), the probability 
that Qp n has no isolated point, as n — > 00, tends to e _A ; hence, the probability that Q™ n is 
connected tends to e~ A . For constant percolation values larger than 1/2, when n tends to infin- 
ity, almost every random subgraph of Q n is connected; for percolation values smaller than 1/2, 
when n tends to infinity, almost no random subgraph of Q n is connected; and for percolation 
values equal 1/2, when n tends to infinity, the probability that Q"/ 2 * s connected tends to e _1 . 
At the end of this section, a comparison between connectivity of a typical random graph with 
M edges and N vertices, G G G(N = 2", M = n2 Tl_1 ), and Q n , after percolation with the 
parameter p n is presented. 

This work continues with exploring the threshold function for the cartesian product of 
K3, the complete graph with 3 vertices, denoted by 3 Q n . It is shown that for p n > 1 — 
(l/v / 3)(logn) 1 /™, 3 Qp n has no components with size larger than 1 and smaller than 3", as 
n — > 00. Then, it is proved that for p n = 1 — (l/v / 3)A 1 / 2,l (l + o(l/n)), the probability that 
3 Qp n has no isolated point, as n — > 00, tends to e _A ; hence, the probability that is con- 
nected tends to e~ A . At last, the author suggests that the threshold value for connectivity of 
the cartesian product of P3, where P3 is a path with length 2, denoted by P3 1 , is 2 — \/2. One 
can show that for percolation values smaller than 2 — y/2, almost no random subgraph of P3" 
is connected, and for percolation values larger than 2 — v2, almost every random subgraph of 
P£ has no isolated point. The author also shows that for percolation values larger than 0.68 
almost all random subgraphs of P£ are connected. 

The last part of this work, sheds light on reliability of networks. The main question in this 
part is: one is given 2 parameters, n and m where n and m are positive integers. Among all 
graphs with n vertices and m edges, which graph G, if any, maximizes the probability that when 
one docs percolation on G with the parameter p n , for all p n in (0, 1) there is one component? G 
would be called the uniformly optimally reliable graph (UOR graph) for the parameter n and 
to. It is shown in this part, for some to and n there is no UOR graph, since the graph which 
maximizes the probability of connectivity depends on p n in that family of graphs. A review of 
results about when the UOR graph exists is presented in this part. 

Keywords: Random subgraphs, percolation, n-cube, path graph, reliable networks. 
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Chapter 1 



Introduction 



Exploring the connectivity of random subgraphs of different families of graphs is one of the most 
interesting topics in random graphs and percolation theory. A random subgraph of a graph 
G(V n , E m ) is a graph which contains all vertices of G, and each edge of G independently with 
probability p n . p n is known as the percolation parameter. Connectivity of a random subgraph 
of a graph can be investigated both for small and considerably large (when n tends to infinity) 
graphs. For considerably large graphs, the first step in exploring the connectivity is to calculate 
p c which for all constant p = p n ,p € (0, 1) and p < p c , as n tends to infinity, almost all random 
subgraphs of G(V n ,E m ) is connected; but for all p £ (0, 1) and p > p c , as n tends to infinity, 
almost no random subgraphs of G(V n ,E m ) is connected. The second step is to investigate what 
happens when p = p c . A more complete approach is to calculate p c when p c depends on n. 

For small graphs, it is of interest to find the uniformly optimally reliable graph (UOR graph). 
Consider G(n, m) as the family of graphs with n vertices and m edges. The UOR graph is the 
graph G £ G{n, m) that maximizes the probability that G is connected after percolation with 
the parameter p n for fixed n, m and all p n € (0, 1). 

One of the interesting graphs for analyzing the connectivity of its random subgraphs is 
the cube graph. The cube graph Q n , is a graph with the vertices labeling 1, 2, 3, 4, 2™ — 1. 
Two vertices in this graph are adjacent if their binary representation differs only in one digit. 
Another way to define Q n is using the cartesian products of n copies of K2, where K n is the 
complete graph with n vertices. It is shown by Paul Erdos and Joel Spencer [3] that p c = 1/2 
for Q n . An extension of Q n is the graph with the vertices labeling 1, 2, 3, 4, 3" — 1, where 
two vertices in this graph are adjacent if their ternary representation differs only in one digit. 
We call this graph 3-cube denoted by 3 Q n . One can show that 3 Q n is the cartesian product of 
K3. Connectivity of random subgraphs of the cartesian product of Ki is investigated by Lane 
Clark [TS] . Another extension of Q n is the cartesian product of n copies of a path with length 
2 which we call it PJ\ 

This thesis is divided into two main parts. The first part (chapter 4,5,6) is devoted to the 
connectivity of random subgraphs of some considerably large graphs, and the second part is the 
connectivity of random subgraphs of small graphs (chapter 7) . Chapter 2 presents a very short 
review of definition and results in graph theory. Chapter 3 is a short review of the definitions 
in random graph theory; it is explained briefly in this chapter that how small components 
construct a giant component and gradually a graph becomes connected by adding more edges 
to it. In chapter 4, the results by Bela Bollobas [T] on finding p c for connectivity of random 
subgraphs of Q n is explained in detail. In chapter 5 the author calculates p c for connectivity of 
random subgraphs of 3 Q n . After calculating the threshold value for the connectivity of random 
subgraphs of 3 Q n , the author found that this problem is solved for a general case of the random 
subgraphs of cartesian product of Ki [TS] . Chapter 6 is an approach to find p c for connectivity 
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of random subgraphs of P£ . This work finishes with chapter 7 which is a review of the results 
on finding the UOR graph. In this chapter it is shown by the author that for some m and n 
there arc no UOR graph. 



2 



Chapter 2 



Graph theory background 



This chapter presents s short review of basic definitions in graph theory. Most of the definitions 
in this chapter are extracted from |10j . 

2. 1 Graph models and their matrix representation 



Definition 1. Graph: A graph G(V,E) is an ordered pair consisting of the set of vertices V, 
and the set of edges E. Each edge is associated with a set of vertices which are called endpoints. 
Two vertices are adjacent if they are joined by an edge. Two edges are adjacent if they have a 
common endpoint. A vertex is incident to an edge and viceversa, if that vertex is an endpoint 
of the edge. A self-loop is an edge which joins a vertex to itself. A multi-edge is a set of two 
or more edges having the same endpoints. A simple graph is a graph without self-loops and 
multi-edges. 



After defining a graph, it is of interest to get familiar with the characteristics of different graphs 
in order to compare them. One of the basic characteristics is the degree of each vertices. 

Definition 2. Degree: The degree of a vertex, denoted by deg(v), is the number of edges 
incident on that vertex plus two times the number of its self-loops. The smallest degree in a 
graph is denoted by S min or 5, and the largest degree in a graph is denoted by S max or A. The 
degree sequence of a graph is the non-increasing sequence of vertice degrees. 

The first question that comes into mind, after defining the degree sequence of a graph, is if 
there exists a degree sequence of a graph for each sequence of positive integers. 

Definition 3. Graphic: A sequence of positive integers is graphic if there is a permutation of 
it that is the degree sequence of a simple graph. An explicit sufficient and necessary condition 
for a sequence of positive integers to be graphical is: 

Theorem 1. A sequence of non- negative integers (di, efe, •••) d n ) is graphical if and only if 



Graphs 



Degrees 



k 



n 




j=k+l 



(2.1) 



for each 1 < k < n |S]. 
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Graph models 

There are many types of graphs. Some of the most important types of simple graphs are: 

Definition 4. Common families of graphs: A complete graph K n is a simple graph with 
n vertices which every pair of vertices is connected by an edge. A bipartite graph G is a graph 
with the set of vertices that can be partitioned into two subsets U and W, such that each edge 
in G has one endpoint in U and one endpoint in W. A regular graph is a graph where all 
vertices have the same degree. A path graph is a simple graph with |V| = \E\ + 1 that can be 
drawn, such that all vertices and edges are in a single straight line. A path graph with |V| = n 
and \E\ = n — 1 is denoted by P n . A hypercube graph (cube graph) is a simple n-regular graph 
with the set of vertices labels from to 2™ — 1, in which two vertices are adjacent if their binary 
representation differs only in one digit. 

Definition 5. Subgraphs: H subgraph of G is a graph whose vertices and edges are in G. If 
Vq = Vh then the subgraph H is said to span the graph G. The induced subgraph on U C Vq 
of G is the graph whose set of vertices is U, and set of edges is all edges of G with two endpoints 
in U . The induced subgraph on D C Eq of G is the graph whose its set of edges is D, and 
its set of vertices is all vertices which are incident with an edge in D. A maximal connected 
subgraph of a graph G is a component of G. 

Definition 6. Cartesian product of a graph: G x H, the Cartesian product of G and H is 
the graph with the set of vertices Vq x Vh and the set of edges (Vg x Eh) U (Eg x Vh)- 

Defining a walk on a graph can help us to define some important characteristic of a graph 
such as connectivity of a graph and spanning trees. 

Definition 7. Walk: In a graph G, a walk from vertex vq to vertex v n is an ordered sequence 

W =< v ,e 1 ,v 1 ,e 2 , ...,v n -i,e n ,v n > (2.2) 

of vertices and edges, such that the endpoints of is {vi—i,Vi} for i = 1, For a simple 

graph one can abbreviate the representation as a vertex sequence 

W=< v ,v 1 ,...,v n > (2.3) 

Definition 8. Tree, spanning tree: A path is a walk with no repeated vertices (except the 
initial and final vertices). A cycle is a nontrivial closed path. A tree is a connected graph 
without cycle. A spanning tree of a graph is a subgraph of a graph which is a tree. 

Matrix representations 

The last important concept in this section is that each graph can be presented as a matrix, as 
follows: 

Definition 9. Matrix representation of a graph: The adjacency matrix of a simple graph 

„ . . r , f 1, if it and v are adjacent; r . r . . T _ 

G is: Ag\u,v\ = < . ,. for all pairs of vertices u and v m Vg- 

L ' [0, otherwise. 

2.2 Connectivity 

Connectivity of a graph is one of the most important property of a graph. A graph is connected 
if for every pair of vertices u and v, there is walk from u to v. There arc different types of 
connectivity for a graph: 
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Definition 10. Vertex-connectivity: k v (G), vertex connectivity of a connected graph G, is 
the minimum number of vertices which its removal will disconnect G or reduce it to a single 
vertex graph. A graph G is k-connected if k v (G) > k. 

Definition 11. Edge-connectivity:re e (G), edge connectivity of a connected graph G, is the 
minimum number of edges which its removal will disconnect G. A graph G is k- edge- connected 
if K e (G) > k 

Definition 12. Algebraic connectivity: The Laplacian matrix of a graph is L := D — A, 
where A is the adjacency matrix and the D is the diagonal matrix of vertex outdegrees. The 
algebraic connectivity of an undirected graph with the Laplacian matrix L is the second smallest 
eigenvalue of L. If one arranges the eigenvalues of L as : Xi(L) < \2(L) < ...X n (L), then \2(L) 
is the algebraic connectivity of a graph. The following theorem presents some applications of 
algebraic connectivity: 

Theorem 2. For an undirected graph with minimum vertex degree S and maximum vertex 
degree A, we have: 

• A2 > with the inequality strict if and only if the graph is connected. 

• A 2 < -M < -2-rA < An- 

a — n—l — n— 1 — n 



Chapter 3 



Random graph theory 
background 



A random graph is a graph with a specific number of vertices which adjacency between two 
vertices are determined in a random way jlOj . In this chapter, we define Erdos Renyi random 
graph, and then we explain briefly some properties of it. 

3.1 Evolution of Erdos Renyi graphs 

G(n, M),0 < M < (2), is the equiprobable space of all simple graphs with the vertex set 
V = {1, 2, n} and M edges. G(n,p), < p < 1, is the collection of all graphs with the vertex 
set V = {1, 2, n} in which two vertices are connected independently with the probability p 

[ID], p. 

It is of interest to study the global structure of a random graph of order n (with n vertices) 
and size M(n) (with M(n) edges). Let us define Lj(G) as the order of the jth largest component 
of a graph G, where if G has fewer than j components then Lj(G) = 0. Consider the random 
graph process G = (Gt)^L where Gt is getting larger by adding more and more edges. When 
t ~ icn and c < 1 then in a.e Gt the maximum of the order of its components is of order 
logn. When c = 1, in a.e G t Li(G\ n / 2 \) has order n 2 / 3 . When t passes n/2, L\(G) begins 
to grow suddenly and the giant component, which is a component whose order is much larger 
than other components, appears. Eventually, small components join the giant component and 
the graph becomes connected. Erdos Renyi proved that (n/2) logn is the sharp threshold for 
connectedness pQ. The following theorem illustrates this fundamental result: 

Theorem 3. Let c e R be fixed and let M = (n/2){logrc + c + o(l)} e N and p = {logn + c + 
o(l)}/n. Then [J: 

P(Gm is connected) — > e~ e as n — > oo (3.1) 

and 

P(G P is connected) — > e~ e as n — > oo. (3.2) 
For more information regarding random graphs one can check [1], [2], [13], [14] . 
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3.2 Properties of almost every graphs 

A graph property T is true for almost every (all) graph if for fixed p = p(n) [3J: 

lim P(G G G{n,p) and G has property T) = 1 for p > (3.3) 

n— >oo 

Some of the important "almost every graph properties" are: 

Theorem 4. For any integer r > 1 and all p £ (0, 1), almost every graph contains K r [10] . 

Theorem 5. Almost every graph is connected for all p £ (0, 1) |10j . 

Theorem 6. For k £ N and all p £ (0, 1), almost every graph is k-connected [TU] . 

3.3 Probabilistic methods 

Usually, the goal in the probabilistic method is to prove the existence of a combinatorial struc- 
ture with a certain property. The usual approach in these methods is to first construct a suitable 
probability space, then show that there exists a random object in that space with the desired 
properties [TT]. Sometimes it is not easy to find the desired object, instead one proves that there 
is an object which almost satisfies the desired conditions |12j . Usually, it is possible to modify 
the almost close object in a deterministic way so that one gets the desired object. Markov's 
inequality, and Chebyshev inequality are two important inequalities used for this purpose. An 
important concept in probabilistic methods is the definition of threshold function, which is: 

Definition 13. r(n) is called a threshold function for a graph property T for G(n, M(n)) if: 

1. When lirrin^oo = almost every graphs do not satisfy T. 

2. When lim„_ J . 00 = 1 almost every graphs satisfy T. 
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Connected random subgraphs 
of the cube 



s 

The cube graph, Q n , is a graph with 2™ vertices. If one labels each vertex of Q n from to 
2™ — 1, then two vertices are adjacent if their binary representation differs only in one digit. 
Hence, one can conclude that each vertex in Q n is connected to n other vertices. In other 
words, Q n has n2 n ~ 1 edges. A random subgraph of Q n is denoted by Qp n - Qp n contains all 
vertices of Q n , and each edge of Q n independently with probability p n . 

It is of interest in this chapter to explore a critical value p c , which for fixed values of p if 
p < p c then the probability that Q™ n is connected, as n — > oo, tends to 0; but if p > p c then 
the probability that Q p l n is connected, as n — > oo, tends to 1. Burtin proved that this critical 
value is 1/2 [6]. Later, P.Erdos and J.Spencer proved that for p = 1/2 the probability that Q™ n 
is connected, as n — > oo, tends to e" 1 |3j- 

In the first section of this chapter, first the probability that Q™ n has no isolated point as 
n — y oo, for fixed p, is investigated. It is proved that for p < 1/2 the probability that Q™ n has 
no isolated point, as n — > oo, tends to 0. Therefore, for p < 1/2 the probability that Q™ n is 
connected, as n — > oo, tends to 0. Then it is proved that, for p > 1/2 the probability that Q" 
has no isolated point, as n — > oo, tends to 1. In the next step, the probability that Q™^ has no 
isolated point, as n -4 oo, when p depends on n and it is close 1/2, is explored. It is proved 
that for A(n) = A > and p n = 1 - (1/2)A 1 /"(1 + o(l/n)), the probability that Q™ n has no 
isolated point, as n — > oo, tends to e _A [T|. Finally, it is proved that, for fixed p = 1/2 the 
probability that Q™ n has no isolated point, as n — > oo, tends to e _1 . These results are based 
on P.Erdos and J.Spencer's work 0]. 

In the second section, one sheds light on the Isoperimetric problem, which is the problem 
of finding an inequality which relates the size of a subgraph to the size of its boundary. The 
solution to this problem for Q" is presented by S.Hart [5]. One needs such an inequality to 
explore the probability that Q Pn has a component which is not the whole graph. 

In the last section, the Isoperimetric inequality is applied to prove that when p depends 
on n and p n > 1 — (l/2)(logn) 1 / n , then the probability that there are no components with 
size larger than 1 and smaller than 2™ in Q™ n , as n — > oo, tends to 1. Therefore, for p n = 
1 - (l/2)A 1 ' /n (l + o(l/n)), the probability that Q p \ is connected, as n — > oo, tends to e . 
Finally, as a special case, it is shown that, for fixed p if p = 1/2, the probability that Q™ n is 

1 The proof presented in this chapter is based on the proof presented by B.Bollobas in [l] p. 384-393. 
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connected , as n — > oo, tend to e 1 ; and if p > 1/2, the probability that Q™ n is connected, as 
n — > oo, tends to 1. 

4-1 Isolated vertices 
For p < 0.5: 

Assume p is fixed and p < 0.5. First, consider the following definitions: 
Definition 14. f n (p v ):=P{Q^ n is connected) 
Definition 15. g n (p n ) ■= P(Q™ n contains an isolated point) 

t-» e -4-- ic vr ^ / 1 Vertex i is isolated, i e y(Q"J; , , , , 

Definition 16. n := < „ Tr , . . AT „_ . , , , . Pn J r i^r, \ , and! n := > XAn) 
%K ' 1 Vertex i is NOT isolated, i e V^(Q" ). v ; ^ v 



Now, calculate E[X(n)] and yar[X(n,)] as follows: 



H-.= E[X{n)]= J2 E l X i(n)}= £ (1 "iO" = 2"(1 "P) n (4.1) 



7ar[X(n)] = ^ 7ar[Xi(n)] + ^ C<w[X 4 (n), X,-(n)] (4.2) 

where, Var[Xj(n)] and Cov[Xi(n), Xj(n)\ arc equal to: 

£ T/ a r[X l (n)]=2»(l-p)"-2"(l-p)"(l-p) Il =/ J -/ J (l-p)" (4.3) 

*eV(Q£„) 

Ccw[X 4 (n),X,(n)] = ^(n^n)] - £[Xi(n)]i?[Xj(n)] (4.4) 

{0 ij not adjacent; 

(1 -p)"(l -p)"- 1 - (l-p)"(l -p) n = ^r(r^) ij adjacent. 

(4.5) 

and finally: 

Far[X(n)] = M - M (l - p)" + ^(y^) = H + m(1 - P ) B (^ - 1) (4-6) 

Now, since we have Var[X(n)], we can use Chcbyshcv's inequality to estimate g n {p). Cheby- 
shev's inequality states that: 

1 - g n (p) = P[X(n) = 0] < P[\X(n) M | > M ] < Y^Ml (4 . 7 ) 

A* 

By applying Chebyshev's inequality when p < 0.5, one gets Var[X(n)]/ [i 2 — > 0, as n — > oo. 
Therefore lim„_ i . 00 g n {p) = 1. And finally, since f n {p) < 1 — <7n(p)j then for p < 0.5 the proba- 
bility that Q™ is connected for p < 0.5, as n — >• oo, tends to 0. ■ 
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For p > 0.5: 

Assume p is fixed and p > 0.5. In order to calculate g n {p) when p > 0.5, as n- > oo, one can 
use the following inequality: 

g n (p) = P[X(n) > 0] < E[X{n)} = f i (4.8) 

Since E[X(n)] — > as n — > oo, then linin-yoo g n {p) = 0. This means that the probability that 
there are no isolated points in for p > 0.5, as n — > oo, tends to 1. ■ 

For p n = 1 - (l/2)A 1 /«(l + o(l/n)): 

One needs the following theorem from [T] to find the distribution of X(n) (distribution of the 
number of isolated points). 

Theorem 7. Let A = A(ro) be a non-negative bounded function on N. Suppose the non-negative 
integer valued random variables X(1),X(2), ... are such that: 

lim {E r [X{n)\ - A 7 '} =0, r = 0, 1, ... (4.9) 

71— >QO 

where i? r [X] is the rth factorial moment of X, i.e. E r [X] = i£[(X) r ]. Then 

X(n) -A P A (4.10) 

Use the definition of X(n) presented in definition [T51 The goal is to calculate E[X{n)}. 

E r [X{n)\ = E[X(n)(X(n) - l)(X(n) - 2)...{X{n) - r + 1)] (4.11) 
Since X{n) := X)tev(Q" ) anc l -X"»'s are indicator functions, therefore: 

X(n)(X(n)-l)(X(n)-2)...(X(n)-r + l)= ^ X^X,,...^ (4.12) 

where the sum is over all ordered sets of distinct vertices. Then: 

E r [X{n)\ = E[X(n)(X(n) - l)(X(n) - 2)...(X(n) - r + 1)] (4.13) 
= E[ Y, X tl X l2 ...X lr ] (4.14) 

(ii,i 2 ,...,ir) 

= P t X ^ = !> X ^ = X ^ = !] ( 4 - 15 ) 

(»l,»3,...,»r) 

One knows that a set of r vertices is incident with at most rn edges. There are (r) r ( 2 r , ) ways 
to choose such r vertices. Hence: 

E r [X(n)\ > (r) r ^yi-Pn) rn = (2") r (l-2vT (4.16) 

One the other hand, a set of r vertices is incident with at least r{n — r) edges. There are at most 
(r — l) r _i ( r _ 1 ) (r — l)n ways to choose a set of r vertices in Q™ n where at least two vertices are 
adjacent; since if we choose r — 1 vertices independently, then the last vertex must be connected 
to one of the chosen vertices. In other words, there are at most (r — l) r _x( — l)n ways 



to choose r vertices which some of them are adjacent to each other. Hence: 

E r [X(n)} < (2 n ) r (l-p„)™ + (r-l) r _ 1 f _ \{r ~ l)n{\ - p n ) r ^ (4.17) 

< (2™) r (l - p n ) rn + (2™) r rn(l - Pn ) r{n ~^ (4.18) 

< (2 n ) r (l - Pn ) rn + 2 n ^ 1 \n{l - p n ) r{n - r) (4.19) 
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Finally from 14. 161 and l4~Ti?l one gets: 

(2") r (l-p n )™ < E r [X(n)] < (2 n ) r (l-p n y n + 2 n ^rn(l- Pn ) r ^ (4.20) 

(4.21) 

which gives: 

(2(1 - Pn )Y n {\ - L.f < E r [X(n)} < (2(1 - p n )Y n {1 + 2-"rn(l - p n )- r2 } (4.22) 

Since r is fixed and lim„_ i . 00 (2(l — p n )) n = A, then: 

lim (£ r [X(n)]) = A r for r=0,l,2,... (4.23) 

n— too 

This shows that X(n) -^ P A ■ 



For p = 0.5: 

In the calculation of p n = 1 — 1/2A 1 /"(1 + o(l/n)), if we fix p = 1/2 and let A = 1, then we 
get that the distribution of X(n) , as n — > oo, tends to a Poisson distribution with mean 1. 
Therefore, one can conclude: 

lim (1 - g n (p)) = lim (P(X(n) = 0)) = e" 1 (4.24) 

This shows that for p = 1/2 the probability that Q™ has no isolated point, as n — » oo, tends 
to e" 1 . ■ 

^.1? i soperimetric problem for the cube 

One needs an inequality which relates the size of a subgraph of Q n to the size of its boundary. 
This inequality will be applied to prove that for fixed values of p if p > 0.5, then the probability 
that subgraphs of Q n do not have a component of size larger than 2 and smaller than 2™, as 
n — > oo, tends to 1. The proof presented here is based on the proof presented in [T]. 

Definition 17. The edge boundary ba(H), where H is an induced subgraph of G, is the number 
of edges which joins vertices in H to the vertices in G\H. 



Definition 18. ba(m) := mm{ba(H), H is an induced subgraph of G, \V(H)\ = m}. 

The main task in this section is to calculate bQn{m). The answer, loosely, is if m = 2 k for 
some k < n then one should take a k-dimcnsional sub-cube of Q n as 6q™ (H). If 2 k < m < 2 fc+1 , 
for some k < n, then one should choose one side of a (k + 1)— cube and m — 2 k more vertices 
properly chosen in the other half. Since Q n is n-regular and H is an induced subgraph of G 
with \V(H)\ = m, then: 

bQn(H) = mn — 2e(H), where e(H) is the total number of edges in H. (4-25) 

bQn(m) = mn — 2e n (m), where e„(m) = max{e(i7) : H induced subgraph of Q n , |V(i?)| = m}. 

(4.26) 

Definition 19. h(i) := sum of digits in the binary expansion of i and f(l, m) := h(i) 



11 



CHAPTER 4. CONNECTED RANDOM SUBGRAPHS OF THE CUBE 



Lemma 1. If 1 < k < I, then f(l, l + k)> /(O, k) + k 
Proof: 

Let look at the binary expansion of a few numbers: 

Column 3 2 10 

Bin\Dec 2 3 2 2 2 1 2° 



1 1 

2 1 

3 1 1 

4 10 

5 10 1 

6 110 

7 111 

8 10 

9 10 1 

10 10 10 

11 10 11 

From this representation, one can observe that column i starts with a block of 2 % zeros. 
Therefore, sum of jth digits of k consecutive numbers is minimal if the first block of 0's is as 
long as possible. Hence, one can conclude: 

f(l,l + k)>f(0,k) (4.27) 

For every i define r such that < i < 2 r — 1. The binary expansions of i and 2 r — 1 — i arc 
symmetric. This means that, if there is a 1/0 in an specific location of the binary expansion of 
i then there is a 0/1 in the same location of the binary expansion of 2 r — 1 — i. Therefore, 

h(i) + h(2 r - 1 - i) = r for < i < 2 r - 1 (4.28) 

Consequently, since: 

l<i<l+k 2 r -l-k<i<2 r -l 

then: 

f(l, l + k) + f(2 r - I - k, 2 r - I) = rk, if I + k < 2 r (4.30) 

Let us prove lemma Q] with the assumption k < 2 r < I by using incq ualitics 14 . 2 71 and 14.301 This 
assumption means that the length of the sequence in the binary expansion of 2 r + k and 2 r are 
equal. 

With the same logic that one gets 14.271 one gets: 

f(l, l + k)> f(2 r , 2 r + k) when 2 r < I (4.31) 
and then for k < 2 r one can get: 



f(2 r ,2 r + k)= J2 M0 ( 4 - 32 ) 

2 r <i<2 r +k 



12 



CHAPTER 4. CONNECTED RANDOM SUBGRAPHS OF THE CUBE 



X^2 r <i<2 r +fe is the sum over numbers with the same length in their binary expansion's 
sequence. When one removes the last digit in their binary expansion, the remain is /(0,fc). 
Therefore. Y^2 r <i<2 r +k M*) i s equal to k l's plus /(0, k). Hence: 

f{2 r , 2 r + k) = k + / (0, k) when k < 2 r < I (4.33) 

and finally: 

f(l, l + k)> /(0, k) + k where k < 2 r < I (4.34) 

Now, one can prove lemma[T]by induction on K, without the assumption k < 2 r < I. We want 
to prove that for 1 < K < I, f(l + K,l) > K + /(0, K). Fix k such that 1 < k < I and K < k. 
For K = 1 the inequality in lemma [T] is trivial. Assume that the inequality is true for K < k 
and K > 2, which means: 

f(h l + K)>K + /(0, K) when 1 < k < I , and K < k (4.35) 

Now, one should verify the inequality for K = k. Define r > 1 by 2 r_1 < k < 2 r . li I > 2 r , 
then k < 2 r < I and the lemma is implied by inequality 14.341 Hence, one may assume that 
2 r_1 < I < 2 r . Now, one should apply inequality 14.301 and 14.351 in order to get the final result: 

/(/ + k) = f(l, 2 r ) + f(2 r , I + k) (from definition off and I > 2 r ) (4.36) 
= (2 r -Or- f(0,2 r -l) + f(2 r ,l + k) (from\^§ (4.37) 

> (2 r - l)r - /(0, 2 r + /(0, 1 + k - 2 r ) + I + k - 2 r ffromtf35\) (4.38) 

> (2 r - l)r - f(2 r - k, 2 r - k + 2 r - I) + 2 r - I + f(0, 1 + k - 2 r ) + I + k - 2 r (from\J^ 

(4.39) 

> (2 r - l)r - f(2 r - k, 2 r - k + 2 r - I) + /(0, l + k- 2 r ) + k (4.40) 

> f(l + k-2 r ,k) + f(0 1 l + k-2 r ) + k (fromtfWjl (4.41) 

> f(0,k) + k (from characteristics of f) (4.42) 



Theorem 8. For 2 < m < 2" we have 6g»(m) = mn — 2/(0, m). In other words, /(0, m) = 
e„(m) where e n (m) = max{e(iJ) : H induced subgraph of Q n , \ V(H)\ = m}. 

Proof: 

First, let us fix an m. As the first step one should prove that e„(m) > /(0, m). Vertex i is 
connected to h(i) vertices j with j < i, since for each 1 in the binary expansion of i there is 
exactly one j (j < i), which its binary expansion differs in the position of that 1. Therefore, 
one can conclude that W = {0, 1, 2, ...,m — 1} contains Y^,o<i<m ^(0 = /(0j m ) edges. So, 
e n (m) > /(0,m). 

As the second step, one should prove that e„(m) < /(0,m) by induction on n. Fix m and 
n for 2 < m < 2". For n = 1 the inequality is trivially true. Assume that it is true for N < n, 
which means: 

eAr(m) < /(0, m), where N <n and the fixed m is: 2 < m < 2 n (4.43) 

Now, one should check the inequality 14.431 for N = n. This means that we should find an H 
induced subgraph of Q n , \V(H)\ = m, which maximize e n (m). Let us split Q n into two (n- 
l)-dimensional cubes, the top face with 2™ _1 vertices and the bottom face with 2™ _1 vertices. 
This means, there are (n — 1)2™ -2 edges in each face, and 2™ _1 edges between two faces. Now, 
one can construct H. Choose mi vertices for H from the top face, and mi vertices from the 
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bottom face, where mi + vri2 = m and mi < m,2. In other words, H is constructed from two 
induced subgraphs, one from the top face, denoted by Hi, and the other from the bottom face, 
denoted by H 2 . 

Each face is a (n-l)-dimcnsional cube, so inequality 14.431 holds for both Hi and H^- Also, 
each vertex of the top face is connected to exactly one vertex of the bottom face. Hence, the 
number of edges of H is at most: 

e„(m) < /(0,mi) + /(0,ma)+mi(fromH33) (4.44) 

where mi, in the right hand side of the inequality, is for the maximum number of edges between 
Hi and H2, which one can choose here. Finally, by applying lemma [TJ one gets: 

e n (m) </(0,mi) + /(0,m 2 )+mi (4.45) 

< f{ m 2, m-2 + mi) + /(0, 7712) (from lemma [1]) (4.46) 

< /(0, m) (from definition of f ) (4.47) 

■ 

Theorem [5] shows that, if we want to choose an induced subgraph of Q n , with m vertices, which 
has the smallest edge boundary, then we should choose the induced subgraph of Q n with the 
set of vertices W = {0, 1, 2, m — 1}. 



Corollary 1. For all k and n, e n {k) < | \l0g2kl , which is equivalent to 6q™ (k) > k(n— \l0g2kl). 
Proof: 

Let r = \l0g2kl . Then 

2/(0, k) < /(0, k) + /(0, k) < /(0, k) + f(2 r - k, 2 r ) (bomMQ (4.48) 
= rk (from USD]) (4.49) 



Therefore: 



e n (k) = f(0,k)<r^ = ^[log 2 k] (4.50) 



4-3 Isolated components of size larger than 2 and smaller than 2 n 

Definition 20. C s is the family of s-subsets (subsets with size s) of V = V(Q n ) whose induced 
graph is connected. 

Remarks: h(n) := o(g(n)) means — > as n — > 00. 

Remarks: The following inequality will be applied a lot in the rest of this section: 

y (4 - 51) 

Theorem 9. If p n > 1 — |(logn)», the probability that for some S £ C s , 2 < s < 2™~ 1 , no 
edges of Q™^ join S to V(Q n ) \ S, as n — > 00, tends to 0. 

Note: For 2™^ 1 < s < 2", if there exist a component of size smaller than 2™ then there is at 
least one component of size smaller than 2 ra_1 which contradicts with the theorem. 
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Proof: 

Consider S C V = V(Q n ) and set b(S) = bQn(H) for which H is the induced subgraph of Q n 
with the set of vertices S. One can observe that: 

P(No edges of Q" n join S to V \ S) = (1 - Pn ) b{s) (4.52) 

In order to prove the theorem, it is sufficient to show that: 

2 n-l 

]T ]T(1- P „) b(s) = (l) (4.53) 

s=2 S&C a 

From corollary [TJ one knows that for |5| = s: 

b(S)>b(s)>s(n-\log 2 s]) (4.54) 

and therefore: 

£(l-J?n) 6(S) < 1^1(1 -P„) 6(s) (4.55) 

sec 5 

Now, one should partition s, 2 < s < 2"~ 1 , to different intervals in order to find a bound for 
\C S \ and (1 — p n ) b ^ for each interval. 

First interval 2 < s < Si, Si = L^i"J : 

First, one should find a bound for |C S |. One has maximum 2 ra choices to choose the first 
element for C s . The selected element is connected to maximum n vertices, therefore there are 
n choices to choose the second element. With the same logic there are at most (s — l)n choices 
to choose the last element for C s . Therefore, one can show: 

< 2"(n)(2n)...((s - l)n)) < (s - l)!(n) s - x 2" (4.56) 

Hence: 

\C s \(l~ Pn ) b ^ < ( s -l)!H s - 1 2"(l-p„) s( ™- riog2sl) (4.57) 

Since p n = 1 — ^(logn)~ , so for large enough n: 

(1 - Pn y<"-^ s ^ < (2)-" s (logn) s (l - p„)" s(log2 s) (neglecting some small terms) (4.58) 
= (2)-" s (logn) s 2 slo ^ s (logn)^ i ^ (4.59) 

— s log2 s 

( since for large enough n: (logn) < 1) (4.60) 

< (2)-" s (logn) s s s (4.61) 

From equations 14.571 and 14.611 one can show that: 

-Pn) b[s) < (s- l)!(?i) s - 1 2"(2)-" s (log7i) s S s (4.62) 

Assume that the right hand side of inequality 14.621 is equal to A. After multiplying both sides 
of inequality 14.621 with — and then taking log 2 from both sides, one gets: 

log 2 (|C s |(l-p„) b «^-) < l 0g2 (A^) (4.63) 
s! s! 
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If log 2 (A n;i s! +1 ) — » — oo asm oo then A " s °, +1 should tend to 0. This means that |C S |(1 — 
p n ^b{s) ns^ 1 tenc j s to o, as n -> oo. Therefore: 

„l 

|CJ(1 - p n ) b{s) < tt for large enough n (4.64) 

ns s+L 

which shows that: 

jh X)(l-Pn) i(S) =o(l) (4-65) 

s=2 sec s 

Finally, it remains to prove log 2 (A ns , + ) — > — oo as n — > oo. One can verify this for s < n and 
s > n. 



Second interval si + 1 < s < and S E C~, Si = L|^J : 
Let us define Cj~ and C+ as follows: 

Definition 21. 

CJ := {S e C s |6(s) > s(n - log 2 s + log 2 n)}, and C+ := C a \C~ (4.66) 
One can bound \C~ \ for si + 1 < s < 2"" 1 as follows: 

oris an 

< — < ( — ) s (4.67) 
s / s! s 

Hence: 

2 »-i 2 »-i 

5^ J! ^ -P») 6(5) ^ H (— -) s (-(logn)^) s ( n - lo ^ s+los =™) (4.68) 

S=Sl + l SeCs 8=81+1 

^ e 2«2-("- lo S2 s+i°S2 n) dog n l ( "~ 1o82 ° + '° E2 " > 

< ^ (— ) s (4.69) 

s=si + l 

< E ( — ) ( lo s") — 5 — ( 4 - 7 °) 

S — Si + 1 

ti(— log2 s + log2 n) 

( since for large enough n: (logn) 3 " < 1) (4-71) 

< E(^r= (l) (4.72) 

S=Sl + l 



Third interval Si < s < s 2 , si = Lt^J,^ = L^pJ and S G C s +: 

For the 3rd and the 4th intervals one needs to know how to find a bound for |C+ 1 . The following 
lemma, presented by B.Bollobas [T], helps us in this matter: 

Lemma 2. Let G be a graph of order v and suppose that A(G) < A, 2e(G) = vd and 
A + 1 <u<« — A — 1. Then, there is a u-set of U of vertices with: 

\N(U)\ = \UUT(U)\ > v±{l - exp C u{A + 1) )} (4.73) 

A v 

where, A(G) := Maximum degree in G, d := average degree in G and T(U) = {x e V(G) : 
G -E'(G) for some ye [/} 
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Let H = QrJS] (the induced subgraph of Q n with the set of vertices S). From the definition 
of C+ one knows that the average degree in H is at least: 

log 2 s - log 2 n (4.74) 

The goal is to find U C S, where \U\ := u := [77 J , A = n, v = s, d > log 2 s — log 2 n and then 
use lemma[2]to calculate \N(U)\. First, one should check the condition A + l<u<v — A — 1 
for defined variables in order to use lemma HJ First, check if n + 1 < [77 J 7 as n 00 : 

2s 2t+! . . 2t+! 

— = — — for minimum s, and trivially n + 1 < — ^— for large enough n (4-75) 

(4.76) 

and then check if [—J < s — (n + 1). One should check if ns — n(n + 1) > 2s, which means one 
should check that whether: 

TOT i 1 <« 7 > 

n J (?i + 1) 

which is clearly true for large enough n. Now, one can apply lemma [5] on the graphs generated 
by S and get : 

3U C S : |7V ([/)| > s lQg2S ' l0g2n {l - exp(-^±% (4.78) 



where: 



on the other hand: 



log 2 s - log 2 n > (log 2 (|r) - lo S2 = n-6 log 2 n 79 

rt n In 

, n — 6 log, n 1 , , 

and lim 62 = - (4.80) 



lim (l-ea;p(-^i(— + 1))) = 1 - e~ 2 (4.81) 

n— >oo s n 

Therefore, from 14.801 and 14.811 one gets: 

\N(U)\ > -(1 - e~ 2 )s > I as n -> 00 (4.82) 

Now that we have |iV(Z7)|, we can estimate a bound for |C+| here. We know from [482] that for 
each S £ C+ there exist a £7 C 5, |C/| := it := LffJ, such that \N(U)\ > s/3. Therefore, one 
can choose S £ C+ as follows: 

1. Select u vertices of Q n ; there are ( ) choices for this u. 

2. Select [_§J — u neighbors of the selected vertices in part 1; there are maximum (2™)" choices, 
since there are at most (Tj + (™) + ( 2 ) +...(") =2™ ways to find neighbors of a vertex in U. 

3. Select [^f J other vertices; there are at most choices. 
Hence: 

2"\ , , / 2" 



1 J (2 " } UfJ' 



and: 

2™ 



£ (i-p„) b(s) < ( 2 u ")(2 n r 

SGC+ V U J 



LfJ 



(1-P„) b(s) (4-84) 
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where: 

(1 -Pn) b{s) < 2- ST V(logn) s (4.85) 
consequently from 14.841 14.851 and 14.511 

E (l-Pn) b{S) < (— ) u 2™(^-)L*J2- s " S s (logn) s (4.86) 
sect U L ~ J 

Write s = 2' 9 ", ((3 = so that: 

2^<-^^/?<l- 4l0g2l ° g " (4.87) 
(log n) 4 n 

Now, find a bound for the inequality 14.861 First calculate the first part of the inequality: 

p on on on on o ofj on 

( ^_)«2«(^)L¥J < = (2 2 2 2 ( | e) | )S ^_ ( ^ ) f (4 . 88) 

e2" 2s 

( since for large enough n and s± < s < s 2 : (-2~) 3 < 1) (4.89) 

T 

= (2 2 2 2 (-e)t) s ^ = c s 2i sn ^-^ = c s 2i sn ^-^ (4.90) 
2 S3 

where c is a positive constant. Now, by substituting 14.901 in 14.861 one gets: 

E (1 - p n ) h{S) < 2- sn s s (\ogn) s c s 2i sn{1 -^ (4.91) 



seel 



and finally from 14.961 



= c s (logn) s 2 — (4.92) 

< c s (logn) s 2- 4sl °^ 10S " , (fromHSa (4.93) 
= c s (logn) s 2 log ^ logn ^ (4.94) 

< c s (logn) s (logn)^ (4.95) 
= c s (logn)^ (4.96) 



E E (! -P»)" (S) < E c s (logn)^ = (1) (4.97) 

s=s i sect 



Fourth interval s 2 + 1 < s < 2 n ~ 1 and , s 2 = L (lo | w) 4 j , S G C+: 

In Jf = <3™[S] (the induced subgraph of Q™ with the set of vertices S), the average degree 
is at least: 

log 2 s — log 2 n > n — 2 log 2 n (4.98) 
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since: 

2 n 2 n 

s > \j, d => \og 2 (- ^) < log 2 s (4.99) 

(logn) 4 (logn) 4 

2 n 

=> log 2 s - log 2 n > log 2 (- -j) - log 2 n>n- log 2 (logn) 4 - log 2 n (4.100) 

(logn) 4 

(for large enough n one can get, n > (logn) 4 ) (4.101) 

>n-21og 2 n (4.102) 

First, look for a subgraph of H with large average degree. Let T be the set of vertices of H 
with degree at least n — (log 2 n) 2 , and set t = \T\. From 14351 one can conclude that the sum of 
degrees in H is at least s(n — 2 log 2 n).We also know that: 

Sum of degrees in S < s(n — 2 log 2 n) (4.103) 

< t x (Maximum degree of vertices in set T of graph H ) (4.104) 

+ (s — t) x (Maximum degree of vertices in set S \ T of graph H ) 

(4.105) 

<tn+(s-t)(n- (log 2 n) 2 ) (4.106) 

=> t > s(l - —?—) (4.107) 
log 2 n 

Define H\ = Q n [T] = H[T] as the induced subgraph spanned by T. We want to calculate 
|^Vffi(£0l f° r some U in Hi, hence we should estimate the size of Hi and after that calculate 
the average degree in T. Let us first calculate e(Hi), the total number of edges in Hi. 

e(Hi) > e(H) - (s - t)n > -(n - 21og 2 ri) — n (from [OS] and EMi (4.108) 

2 log 2 n 

One knows that the average degree in Hi is at least 2e \ Hl > ; and: 

^1 >n-2 log 2 n - JUL- > n - -L_ (4.109) 
s log 2 n iog 2 ?i 

(since: log, n 2 < for large enough n) (4.110) 

log 2 n y v 

Set u = L"^rJ • One should check the conditions of lcmma[T]hcre. Let v = t, A = n, d > n— log 5 - . 
So, one should check if n + 1 < < < — (n + 1) for large enough n. Clearly, n + 1 < -^r, as 
n — > oo. It remains to prove < t — (n+ 1), for large enough n. For minimum s from 14. 1071 

n 2 

we can get: 



2™ 2 

t>- -j(l-- )(from|Sn3 (4.111) 

(logn) 4 log 2 n 

2 n 

> —j- (for large enough n) (4.112) 



Now, one can use lemma [T] and estimate \Nh 1 (U)\. 



\N Hl (U)\ > -fa-- ){l-exp(— ± r )} (4.113) 

n log 2 n t n 2 

t, 5 Nr , n + 12\, 
>;1-; {1-exp -} (4.114) 

2 10Ko7l t 77,2 
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After that, let us estimate a bound for exp(— ^j^-^t-)- One knows that t > s(l — log 2 - ). Since 
max(i) = s and max(s) = 2™ _1 , then: 

2™(n + l) 2 n (n+l) 2(n+l) i,„ , 

— — > — j = — — j — - > n< ( for large enough n) (4.115) 

Th -\~ 1 2 n 

=4» {1 — cxp( -)} > cxp(— n*)( for large enough n) (4.116) 

t 712 

By using the bound from 14.1 TBI in 14. ll"4l one gets: 

\N H (U)\ > \N Hl (U)\ >t{l-— — ){l-exp(-ni)} (4.117) 

log 2 n 

= t{l + cxp(-ni)- cxp(-n^) - } (4.118) 

log 2 n log 2 n 

( lim cxp(— n 3 )- = and exp(— n*) < ( for large enough n)) 

n->oo log 2 n log 2 n 

(4.119) 

> t{l ^— = t(l (4.120) 

log 2 n log 2 n log 2 n 

> s(l - r^-Xl - = s(l + 2 6 - -^—)({rommm 

log 2 n log 2 n log 2 nlog 2 ?i log 2 n 

(4.121) 

> a (l--i_) (4.122) 
log 2 n 

Now that we have | TVjj- (C/) | , we can estimate a bound for |C+| here. We know from 14.1221 that 
for each S <G C+ there exist a [/ C 5, := u := such that \N H (U)\ > s(l - r—r). 

Therefore, one can choose S € C+ as follows: 

1. Select u vertices of Q n ; there are ( 2 M ) choices for this u. 

2. Select |_s(l — log 8 n )\ — u neighbors of the selected vertices in part 1. At most (log 2 n) 2 of the 
n neighbors of a vertex in U do not belong to Nh{U). Hence there arc at most 2^(fe )(Il^=i (j)) 
ways to find neighbors of u vertices in U, where the sum is over all (fci, k%i k u ), fcj < (log 2 n) 2 . 
We know that: 

u / \ u i 

En(r>En^) (4-^3) 

(Jfe 4 ) i=l v %/ (hi) i=l 

((log 2 n) 2 r 

((log 2 n) 2 )"-f 



E n u(log 2 n) 
771 7?TT- (4-125) 

(fci 

n «(log 2 n) 2 

(Ooi^) 2 !)" 

< n «(log 2 n) 2 (4.127) 

(4.128) 



3. Select I , 8s I other vertices; there are at most f l s ) choices 
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Hence: 



(l-Pn) b{S) < f 2 jV" ( '° S2 " )2 f L g" jV-^-^^qogn)^ 1 -^) (4.129) 

(4.130) 



where: 



Therefore: 



where: 



\n(io S2 n) gs j < 2 °W (4.131) 



V l log 2 n J 



J2 (l-Pn) b(S) < 2 e(s) (4.132) 



secy 



e(s) = o(s) — s{n — log 2 s — log 2 logn + l^l-Lf l g 2 logn} (4.133) 

n 

Since s < 2™~ 1 , hence one can get: 

n — 1 

e( s ) < o(s) — s{n — (n — 1) — log 2 logn H log 2 logn} (4.134) 

= (s)-s{l--log 2 logn}< (4.135) 
n 2 

Therefore, for large enough n, one can get: 

2 „-! 2 „-i 

2 £ (l-Pn) 6(S) < E 2-«=o(l) (4.136) 

s=s 2 +l SeCf s=s 2 + l 



^.^ Comparison between a typical random graph and Q n 

Consider Per(G(V, E),p n ) as the subgraph of G(V, E) after percolation with the parameter p n . 
The goal in this section is to compare P(Q™ n is connected) with P(Per(G E G(N = 2 n ,M = 
n2 n_1 ),p„) is connected) asn->oo. 

One knows, the probability that G p E G(n,p) is connected, for p = c log n/n as n — > oo, 
tends to DP : 

[ 1 ifol; 
lim P (G p G G(n,p) is connected) = \ 1 - e" 1 if c = 1; (4.137) 

[0 if c < 1. 

Theorem 10. If Q is a convex property and J"?( 2 ) — > oo, then almost every graph in G(n,p) 
has Q iff for every fixed x a.e. graph in G(n, M) has Q, when M = Lp( 2 ) + a; (P'7( 2 ) 5 )J DP- 

Definition 22. Q is a convex property if F C G C if and F £ Q and H E Q then G E Q. 



21 



CHAPTER 4. CONNECTED RANDOM SUBGRAPHS OF THE CUBE 



We want to calculate P(Per(G m G G(n,M = m),p n ) is connected) as n — > oo. From 
theorem[lOl one can conclude that G(n,p) and G(n, M = p\o\) have almost the same behavior 
for connectivity, as n — s- oo. Hence: 

P(Per(G M G G(n,M = p^\),p n ) is connected) (4.138) 

sa P(G P G G(n,pp n )) is connected), as ra — >■ oo (4.139) 

We can calculate P{G P G G(n,pp n )) is connected) from theorem HU1 for pp n = cAogn/n, as 
n — > oo. Therefore, we should consider p n = c(n — l)logn/(2m). Finally, one can calculate 
the probability that Gm G G(n, M = m) is connected after percolation with the parameter 
p n = c(n — l)logn/(2m) as: 

{1 for pnS such that c > 1; 

1 — er 1 for p n 's such that c = 1; 
for pnS such that c < 1. 

(4.140) 

Now, let us calculate P(Per(G M G G(2 n ,n2 n - 1 ),p n ) is connected). From (|4.140[) . for p n = 
c(l — one gets: 

!1 for p^s such that c > log 2; 

1 - e _1 for p„'s such that c = log 2; 
for p„'s such that c < log 2. 

(4.141) 

Finally, it can be concluded that, limn^oo P{Per{Gu G G(2 n , n2 n ~ 1 ),p') is connected) = 1 for 
0.3 w log 2 < p' < 0.5, while lim,^,^ P(<5™, is connected) = for log 2 < p' < 0.5. Hence, a 
typical graph with 2™ vertices and n2 n ~ 1 edges is "more connected" than Q n . 
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Chapter 5 



Connected random subgraphs 
of the 3-cube 



The 3-cube graph, 3 Q n , is a simple graph with 3" vertices. If one labels each vertex of 3 Q n 
from to 3" — 1, then two vertices are adjacent if their ternary representation differs only in 
one digit. The number of edges in 3 Q n is n3™, since each vertex is connected to 2n vertices. 
A random subgraph of 3 Q n contains all vertices of 3 Q n , and each edge independently with 
probability p„. 3 Qp n stands for a random subgraph of 3 Q n . 

The main goal in this chapter is to explore a critical value p c , which for fixed values of p if 
p < p c then the probability that 3 Qp n is connected, as n — > oo, tends to 0; but if p > p c then 
the probability that 3 Qp n is connected, asn-> oo, tends to 1. We prove that this critical value 

is (V3-1)/(V3). 

In the first section of this chapter, first the probability that 3 Qp n has no isolated point as 
n —> oo, for fixed p, is investigated. It is proved that for p < (y/3 — l)/(\/3) the probability 
that 3 Qp n has no isolated point, as n — s- oo, tends to 0. Therefore, for p < (y/3 - 1)/(V3) 
the probability that Qp n is connected, as n — > oo, tends to 0. Then it is proved that, for 
p > (VS — l)/(v / 3) the probability that 3 Qp n has no isolated point, as n — > oo, tends to 1. 
In the next step, the probability that 3 Qp n has no isolated point, as n — > oo, when p depends 
on n and it is close (\/3 — l)/(v / 3), is explored. It is proved that for A(n) = A > and 
p n = 1 — (l/v / 3)A 1 / 2 "(l + o(l/n), the probability that 3 Qp n has no isolated point, as n — > oo, 
tends to e _A . Finally, it is proved that, for fixed p = — l)/(y/3) the probability that 3 Qp n 
has no isolated point, as n — > oo, tends to e _1 . 

In the second section, one sheds light on the Isoperimetric problem, which is the problem 
of finding an inequality which relates the size of a subgraph to the size of its boundary. One 
needs such an inequality to explore the probability that 3 Qp n has a component which is not 
the whole graph. 

In the last section, the Isoperimetric inequality is applied to prove that when p depends 
on n and p n > 1 — (l/v / 3)(logn) 1 /™, then the probability that there are no components with 
size larger than 1 and smaller than 3™ in 3 Qp n , as n — > oo, tends to 1. Therefore, for 
p n — 1 — (l/v / 3)A 1 / 2ri (l + o(l/n)), the probability that 3 Qp n is connected, as n — > oo, tends to 
e~ A . Finally, as a special case, it is shown that, for fixed p if p = (v3 — 1)/ (a/3), the probability 
that 3 Qp n is connected , as n — > oo, tend to e _1 ; and if p > (\/3 — l)/(v / 3), the probability 
that 3 Qp n is connected, as n — > oo, tends to 1. 

1 The proof presented for the 3-cube, in this chapter, is independent of the proof presented in the previous 
chapter; therefore, some parts of the proofs overlap each other. 
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5.1 Isolated vertices 
For p n < 

First, one should consider the following definitions: 
Definition 23. J„(p„):=P( 3 Q™ re is connected) 
Definition 24. g n (p n ) := P( 3 Q" n contains an isolated point) 

f 1 Vertex i is isolated, i £ V( 3 Qp n ); 



t~» g -4.- oc v i \ J 1 vertex i is isoiaieu, i t v -y. j , „, > \- „ 

Definition 25. Aj(n) := < _ „ . . . at ^t, . , , , . ^K/n^n ^ ,andl n := > X 
y ' Vertex i is NOT isolated, i G VrQV; ). ' v ; ^ 



Now, calculate _E[X(n)] and Var[X(n)] as follows: 



/x := E[X(n)} = £ SpQ(n)] = ]T (1 - p) 2 " = 3"(1 - p) 2 " (5.1) 
<eV(3Q» n ) ie y(3 Q?n ) 

7ar[X(n)] = ^ Vor[Xi(n)] + Cou[Xi(n), Xj(ra)] (5.2) 

where, Far[Xi(n)] and Coi>[Xj(n), X,(n)] are equal to: 

£ Var[JQ(n)] = 3"(1 - p) 2n - 3 n (1 - p) 2n (1 - p) 2n = fi - fx(l - p) 2n (5.3) 

Coupon), ^-(n)] = £;[Xi(n)X,-(n)] - £pQ(n)],E[Xj-(n)] (5.4) 

{0 i,j not adjacent; 

(1 - p) 2 "(l - p) 2 "" 1 - (1 - p) 2 "(l - p) 2 " = ^(^) i,j adjacent. 

(5.5) 

and finally: 

Var[X(n)] = M - m(1 - p) 2 " + ^(^-) = M + m(1 - p) 2 "(^- - 1) (5.6) 

J™ 1-p 1 — p 

Now, since we have Var[X(n)], we can use Chebyshev's inequality to estimate g n {p). Cheby- 
shev's inequality states that: 

1 - g n (p) = P[X(n) = 0] < P[\X(n) - M | > A*] < Var[X{n)] (5.7) 

By applying Chebyshev's inequality when p < (vo — l)/(-\/3), one gets yar[AT(n)]//i 2 — ^ 0, 
as n — > oo. Therefore lim„_ i . 00 g„ (p) = 1. And finally, since f n (p) < 1 — 9n(p), then for 
p < (V3 — 1)7(^3) the probability that 3 Qp n is connected, asn^ co, tends to 0. ■ 

For p n > 

Assume p is fixed and p > (y/3— 1)7(^3)- In order to calculate g n (p) when p > (-\/3— 1)7(^3), 
as ti — )■ oo , one can use the following inequality: 

g n (p) = P[X(n) > 0] < E[X(n)} = p (5.8) 
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Since E[X(n)] — > as n —> oo, then lim,,^,^ g n {p) = 0. This means that the probability that 
there are no isolated points in 3 Qp n for p > (\/3 — l)/(\/3), as n — > oo, tends to 1. ■ 

For p n = 1 - (l/V3)\ 1/2n (l + o(l/n)): 

One needs the following theorem from [T] to find the distribution of X(n) (distribution of the 
number of isolated points). 

Theorem 11. Let A = A(n) be a non-negative bounded function on N. Suppose the non- 
negative integer valued random variables X(1),X(2), ... are such that: 

lim {E r [X(n)\ - A r } =0, r = 0, 1, ... (5.9) 

where £? r [-^*] is the rth factorial moment of X, i.e. = E[(X) r ]. Then 

X{n)^V x (5.10) 

Use the definition of X(n) presented in definition |2~51 The goal is to calculate E\X{ri)\. 

E r [X(n)\ = E{X(ri)(X(ri) - l)(X{n) - 2)...{X{n) - r+ 1)] (5.11) 

Since X(n) := X)iey(Q") Xi(n) and X^s are indicator functions, therefore: 

X{n)(X(n)-l){X{n)~2)...{X(n)-r + l)= ]T X n X l2 ...X lr (5.12) 

(ii,i2j— )»r) 

where the sum is over all ordered sets of distinct vertices. Then: 

E r [X(ri)} = E[X{n){X{n) - l){X{n) - 2)...{X{n) - r + 1)] (5.13) 
= E[ J2 X n X l2 ...X lr ] (5.14) 

(h,i 2 ,...,ir) 

= J2 P[X il = l,X ia = l,...,Xi r = l] (5.15) 

One knows that a set of r vertices is incident with at most 2rn edges. There are (r) r ( 3 r ) ways 
to choose such r vertices. Hence: 

E r [X(n)\ > (r) r ( 3 r ")(l - Pn) 2 ™ = (3 n )r(l - Pn) 2rn (5.16) 

One the other hand, a set of r vertices is incident with at least r{2n—r) edges. There are at most 
(r— l) r _i( (r — l)2n ways to choose a set of r vertices in 3 Qp n where at least two vertices are 
adjacent; since if we choose r — 1 vertices independently, then the last vertex must be connected 
to one of the chosen vertices. In other words, there are at most (r — l) r —i (__])(*" — l)2n ways 
to choose r vertices which some of them are adjacent to each other. Hence: 



E r [X(n)} < (3") r (l - A ) a ™ + (r - l) r _! j 2(r- - l)n(l - p,,)^ 2 ^' (5.17) 

< (3") r (l - Pn ) 2rn + (3") r 2m(l - p n ) r ^ n - r ^ (5.18) 

< (3 n ) r (l - Pn) 2rn + 3 ,l(r - 1) 2rn(l - Pn ) r{2n - r) (5.19) 
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Finally from 1 5 . 1 61 and l5~Ti?l one gets: 

(3 n ) r (l-p n ) 2rn <E r [X(n)} < {T) r {l-p n f rn + ^ r - l Hrn{l-p n ) r{ - 2n - r) (5.20) 

(5.21) 

which gives: 

(3(1 - p n ) 2 ) rn {l - < E r [X{n)] < (3(1 - Pn ) 2 Y n {\ + 3" n 2rn(l - Pn )- r2 } (5.22) 

Since r is fixed and lim„_ i . 00 (3(l — p n ) 2 ) n = A, then: 

lim (E r [X(n)]) = A r for r=0,l,2,... (5.23) 

n— J-oo 

This shows that X(n) -A P A . ■ 



For p = 0.5: 

In the calculation ofp n = 1 - l/\/3A 1 / 2 "(l +o(l/n)), if we fix p= (a/3- 1)/(a/3) and let A = 1, 
then we get that the distribution of X(n) , as n — > oo, tends to a Poisson distribution with 
mean 1. Therefore, one can conclude: 

lim (1 - g n (p)) = lim (P(X(n) = 0)) = e" 1 (5.24) 

n— >oo n— foo 

This shows that for p = (\/3 — 1)7(^3) the probability that 3 Qp n has no isolated point, as 
n — > oo, tends to e _1 . ■ 

5.2 Isoperimetric problem for the 3-cube 

One needs an inequality which relates the size of a subgraph of 3 Q n to the size of its boundary. 
This inequality will be applied to prove that for p > (a/3 — l)/(\/3), the probability that 
subgraphs of 3 Q n do not have a component of size larger than 2 and smaller than 3", as 
n — > oo, tends to 1. 

Definition 26. The edge boundary bc{H), where H is an induced subgraph of G, is the number 
of edges which joins vertices in H to the vertices in G\H. 

The main task in this section is to calculate &3gn(fc). Since 3 Q n is 2n-regular and H is an 
induced subgraph of G with |1^(H)| = m, then: 

bsgn (H) = ran — 2e(H), where e(H) is the total number of edges in H. (5.25) 

bsQ n (k) = mn — 2e„(fc), where e n (k) = max{e(H) : H induced subgraph of 3 Q™, |^(#)| = m}. 

(5.26) 



Definition 27. h(i) := sum of digits in the ternary expansion of i and /(/, m) := h(i) 
Lemma 3. If 1 < k < I, then f(l, l + k) > /(0, k) + k 
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Proof: 

Look at the ternary expansion of a few numbers: 

Column 2 10 

Bin\Dec 3 2 3 1 3° 



1 1 

2 2 

3 1 

4 1 1 

5 1 2 

6 2 

7 2 1 

8 2 2 

9 10 

10 10 1 

11 10 2 

12 110 

13 111 

14 112 

15 12 

16 12 1 

17 12 2 

18 2 

19 2 1 

20 2 2 

21 2 10 

From this representation, one can observe that column i starts with a block of 3 1 zeros. 
Therefore, sum of jth digits of k numbers, one after the other, is minimal if the first block of 
0's is as long as possible. Hence, one can conclude: 

f(l,l + k)>f(0,k) (5.27) 

For every i define r such that < i < 3 r — 1 . The ternary expansions of i and 3 r — 1 — i are 
symmetric. This means that, if there is a 2/0 in an specific location of the ternary expansion 
of i then there is a 0/2 in the same location of the ternary expansion of 3 r — 1 — i; and if there 
is a 1 in an specific location of the ternary expansion of i then there is a 1 in the same location 
of the ternary expansion of 3 r — 1 — i. Therefore, 

h(i) + h{3 r - 1 - i) = 2r for < i < 3 r - 1 (5.28) 

Consequently, since: 

Hi) + = 2rk (5.29) 

l<i<l+k 3 r ~l~k<i<3 r -l 

then: 

f(l, l + k) + ,/(3 r - I - k, 3 r - I) = 2rfc, where I + k < 3 r (5.30) 

One should first prove lemma [3] with the assumption k < 3 ? * < / by using \E7ZT\ and ??. This 
assumption means that the length of the sequence of 0's, l's and 2's in the ternary expansion 
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of 3 r + k and 3 r are equal. 

With the same logic that one gets 15.271 one gets: 

f(l, l + k)> /(3 r , 3 r + k), when 3 r < I (5.31) 
and then for k < 3 r one can get: 

f(3 r ,3 r + k)= ]T fe (0 ( 5 - 32 ) 

3 r <i<3 r +fc 

E3 r <t<3 r +fc ^(*) ^ S ^ ne sum over numbers with the same length in their ternary expansion's 
sequence. When one removes the last digit in their ternary expansion, what remains is /(0, k). 
Therefore, X^^io^+fc M*) i s equal to sum of the last digits plus f(0,k). Hence: 

/(/, I + k) > f(3 r , 3 r + k) = k + /(0, k) when k < 3 r < I (5.33) 

and finally: 

f(h l + k)> /(0, k) + k where k < 3 r < I (5.34) 

Lemma |31 is proved with the assumption k < 3 r < I. Now, one should prove lemma [3] without 
this assumption. The proof is based on induction on K. We want to prove that for 1 < K < I, 
f(l + K, I) > K + /(0, K). Fix k such that 1 < k < I and K < k. For K = 1 the inequality is 
trivial. Assume that the inequality is true for K < k and K > 2, which means: 



f(l, l + K)> K + /(0, K) when 1 < k < I , and K < k (5.35) 

Now, one should prove the inequality for K = k. Define r > 1 by 3 r_1 < k < 3 r . If / > 3 r , then 
k < 3 r < I and the lemma is implied by inequality 15.341 One may assume that 3 r_1 < I < 3 r . 
Finally, one should apply inequality 15.301 and 15.351 in order to get the final result: 

/(/ + k) = f(l, 3 r ) + /(3 r , I + k) (from definition off and I > 3 r ) (5.36) 

= (3 r - l)2r - f(0, 3 r - I) + f(3 r , I + k) (from[5JW (5-37) 

> (3 r - l)2r - /(0, 3 r - I) + /(0, 1 + k - 3 r ) + I + k - 3 r (from\EM (5-38) 

> (3 r - l)2r - f(3 r - k, 3 r - k + 3 r - I) + 3 r - I + /(0, 1 + k - 3 r ) + I + k - 3 r (from\K 

(5.39) 

> (3 r - l)2r - f(3 r - k, 3 r - k + 3 r - I) + /(0, 1 + k - 3 r ) + k (5.40) 
>f(l + k- 3 r , k) + /(0, 1 + k - 3 r ) + k (from Uum (5.41) 

> f(0,k) + k (from characteristics of f) (5-42) 



Theorem 12. For 2 < m < 3™ we have &3Qn(m) = mn — 2/(0, m). In other words, /(0, m) = 
e n (m) where e„(m) = max{e(7J) : H induced subgraph of 3 Q n \V(H)\ = m). 

Proof: 

First, let us fix an m. As the first step, one should prove that e„(m) > /(0,m). Vertex i 
is connected to h(i) vertices j with j < i, since for each 1 (2) in the ternary expansion of i 
there is exactly one j (j < i), which its ternary expansion differs in the position of that 1 (2). 
Therefore, on can conclude that W = {0, 1, 2, to— 1} contains Ylo<i<m M*) = m ) edges. 
So, e„(m) > /(0,m). 
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As the second step, one should prove that e n (m) < /(0,m) by induction on n for fixed m. 
For n = 1 the inequality is trivially true. Assume that it is true for N < n, which means: 

ejv(ra) < /(0, m), where N < n and the fixed m is 3 < m < 3™ (5.43) 

Now, one should check the inequality 15.431 for N = n. This means that we should find an H 
induced subgraph of 3 Q n , |V(if)| = m, which maximize e„(m). Let us split 3 Q n into three 
(n-l)-dimensional cubes, face-1, face-2 and face-3 each with 3 n_1 vertices and (n— l)3 n_1 edges. 
Now, one can construct H. Choose mi vertices for H from face-1, 777,2 vertices from face-2, and 
7773 vertices from face-3 where mi + 777,2 + = m and mi < mi < r«3 and mi + 7772 < m3. In 
other words, 77 is constructed from three induced subgraphs, denoted by Hi, Hi, and H3. 

Each face is a (n-l)-dimcnsional cube, therefore inequality 15.431 holds for Hi, H% and H3. 
Also, each vertex of each face is connected to exactly on vertex from one face and one vertex 
from the other face. Hence, the number of vertices of H is at most: 

e„(m) < /(0, mi) + /(0, m 2 ) + /(0, m 3 ) + 2m x + m 2 (5.44) 

2?77i + 77i2 is the maximum number of edges between three faces that one can choose here. 2mi 
is the maximum number of edges between chosen vertices in H\ and Hi plus the maximum 
number of edges between chosen vertices in Hi and H3. Consequently, m 2 is the maximum 



number of edges between chosen vertices in Hi and H3. Therefore: 

e„(m) < /(0, mi) + /(0, m 2 ) + /(0, m 3 ) + 1m x + m 2 (framES) (5.45) 

< f(mi, mi + mi) + /(0, m 2 ) + mi + 7712 (from lemma [3]) (5.46) 

< ,/(m3, 7773 + 7772 + mi) + /(0, 7773) (from lemma[3]) (5-47) 

< ,/(0, m) (from definition of f) (5.48) 



Theorem [T2l shows that, if we want to choose an induced subgraph of 3 Q n , with m vertices, 
which has the smallest edge boundary, then we should choose the induced subgraph of 3 Q n 
with the set of vertices W = {0, 1, 2, m — 1}. 



Corollary 2. For all k and n, e n (k) < k\logsk~\, which is equivalent to baQ n (k) > 2k(n — 
\logzk~\). 

Proof: 

Let r = [log 3 k] . Then 

2/(0, k) = /(0, k) + /(0, k) < f(0, k) + f(3 r - k, 3 r ) (fromESJ (5.49) 

= 2rfc (from[Oni) (5.50) 

Therefore: 

e n (k) = /(0, fc) < rk = k\log 3 k] (5.51) 



5. 3 Isolated components of size larger than 2 and smaller than 3 n 

Definition 28. C s is the family of s-subsets (subsets with size s) of V = V( 3 Q n ) whose induced 
graph is connected. 
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Remarks: h(n) := o(g(n)) means — > as n — > oo. 

Remarks: The following inequality will be applied a lot in the rest of this section: 

<*> (5 - 52) 

Theorem 13. If p„ > 1 — ^j(logn)", the probability that for some S £ C s ,2 < s < 4p , no 
edges of 3 Q" n join S to y( 3 Q Tl ) \ S, as n -» oo, tends to 0. 

3" 



Note: For 4p < s < 3" , if there exist a component of size smaller than 3™ then there is at least 
one component of size smaller than ^- which contradicts with the theorem. 
Proof: 

Consider S C V = V( 3 Q n ) and set b(S) = 6g»(i?) where if is the induced subgraph of 3 Q n 
with the set of vertices S. One can observe that: 

P(No edges of 3 Q" n join S to V \ S) = (1 - Pn ) b{S) (5.53) 

In order to prove the theorem, it is sufficient to show: 

£ £(l-p n ) b(S) = (l) (5.54) 

s=2 sec s 

From corollary [3 one knows that for \S\ = s: 

b(S) > b(s) > 2s(n - [log 3 s\) (5.55) 

and therefore: 

Y t 0-~Pn) KS) <\Cs\0--Pn) Ka) (5.56) 

sec 

One may partition s, 2 < s < 4^-, to different intervals in order to find a small enough bound 
for \C a \ and (1 - Pn ) b ^ . 

First interval 2 < s < Si, Si = [^t\ ■ 

First, one should find a bound for |C S |. One has maximum 3™ choices to choose the first 
element for C s . The selected element is connected to maximum 2n vertices, therefore, there 
are maximum 2n choices to choose the second element. With the same logic, there are at most 
2n(s — 1) choices to choose the last element for \C S \. Hence, one can show: 

< 3 n (2n)(2n(2))...(2n(* - 1)) < (s - l)!(2n)' _1 3 n (5.57) 

and: 

|C S |(1 -p„) 6(s) < (s-l)!(2n) s - 1 3"(l-p„) 2s( "" riog3sl) (5.58) 
Since p n = 1 — -^(logn)", so for large enough n: 
(1 - Pn )Mn-[iog 3 s]) < (3)-™* (log n ) 2s (1 - J , n )- 2s ( 1 °S3s) (neglecting some small terms) (5.59) 

-2a log 3 s 

( since: (logn) ™ < 1 for large enough n) (5.60) 

= (3)~ KS (log n) 2s 3 slog 3 8 (logn) ' (5.61) 

< (3)-" s (logn) 2s s s (5.62) 
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From equations 15.581 and 15.621 one gets: 

\C S \(1- Pn ) b ^ < (s- l)!(2n)^ 1 3"(3)-" s (logn) 2s S s (5.63) 

Assume that the right hand sides of inequality 15.631 is equal to A. After multiplying both side 
of inequality 15. 631 with 2™^ — and then getting log 3 from both sides, one gets: 

Iog 3 (|C.|(l - Pn )»W£^i_) < lo g3 (A^^) (5.64) 

If log 3 (A 2 "^ +1 ) -> — oo as n -> oo then A 2 "^ +1 should tend to 0. This means that |C S |(1 — 
Pn) fc(s) 2 "si +1 tends to 0, as n ~> oo. Therefore: 

2?is 

which shows that: 



|C S |(1 - p n ) b{s) < - S \, for large values of n (5.65) 



£ £(1-^)^=0(1) (5.66) 

s=2 sec. 

Finally, it remains to prove log 3 (^4 2 "^ + ) — > — oo as n — > oo. One can verify this for s < n and 
s > n. ■ 



Second interval s% + 1 < s < ^ and S £C~, si = : 
Dchnc Cj" and Cf as follows: 

Definition 29. 

C7 := {5 e C s \b(s) > 2s(n - log 3 s + log 3 n)}, and C+ := C B \C- (5.67) 
One can bound \C~ \ for si + 1 < s < 3? as follows: 

>n\ Qns e3 n 

s y s! s 



|C7|<|C S |< ( . ) < — <( — Y (5.68) 



Hence: 

3^ 3* 



£ ]T (l-p„) b(5) < ^ (— ) S (V3 \lo g n)^f< n - lo ^ s+1 °^^ (5.69) 

s=si + l s=si+l 



< £ ( e3"3-("- lo g3 S +i° S3 ») (log „) ^gg±l^ -> ^ ^ 

S=Sl + l 

^ ,e3™3-™3 lo S3 s 3- lo S3™(l gn) 2 x 3 n C -io M . + io Ea „) 

< 2-i ( s ^^) s (logn) • (5.71) 

s=si + l 

(since for large enough n: (logn) = < 1) (5-72) 



,e(log?i) 2 

s=si+l 



< E C-^r = o(l) (5.73) 
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Third interval Sl < s < s a ,*i = l^-\,s 2 = L^yd and S e C7+: 

For the 3rd and the 4th intervals one needs to find a bound for |C+|. The following lemma, 
presented by B.Bollobas [I], helps us in this matter: 

Lemma 4. Let G be a graph of order v and suppose that A(G) < A, 2e(G) = vd and 
A + 1 < « < u — A — 1. Then, there is a u-set of U of vertices with [1]: 

\N(U)\ = \UU T(U)\ > - cxp( ~" (A + 1) )} (5.74) 

A v 

where, A(G) := Maximum degree in G, d := average degree in G and T(U) = {x <E V(G) : 
xy e E(G) for some y e U} 

Let if = 3 Q n [S] (the induced subgraph of 3 Qn with the set of vertices S) . From the definition 
of G+ one knows that the average degree in H is at least: 

2(log 3 s - log 3 n) (5.75) 

The goal is to find U C S, where \U\ := u := |_„J) ^ = ^ n > v = s , d > log 3 s — log 3 n and 
then use lemma|4]to calculate the boundary size of U, \N(U)\. First, check if 2n + 1 < [77 J , as 
n — > 00: 

2s 23™/ 2 23"/ 2 

— = — ^ — for minimum s, and trivially 2n + 1 < — 5 — , for large enough n (5.76) 

n n A n A 

(5.77) 

and then check if [—J < s — (2n + 1). One should check if ns — n(2n + 1) > 2s which means 
one should check that whether: 

n i {2n + 1) 

which is clearly true for large enough n. Now, one can apply lemma 2] on the graph generated 
by S and get: 

, . , 2(logo s — log, n) r , u(2n + l)., 

3U C S : N(U) > s V 83 „ ^-^{1-exp i '-)} 5.79 

2n s 



where: 



2(log 3 s-log3 n) > (log 3 (^)-log3 n) = f -31og 3 n 
2n n n 

f-31og 3 n 1 
which lim ± 23_ = (5.8i) 

n— >oo n 2 



on the other hand: 



lim (1 - cxp(-^±i(— + 1))) = 1 - e" 4 (5.82) 

n— s-oo s 71 



Therefore, from 15.811 and 15.821 one gets: 



\N(U)\ > -(1 - e~ 4 )s > ^ as n -> 00 (5.83) 
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Now that we have \N(U)\, we can estimate a bound for |C+| here. We know from l5.83"l that for 
each S £ C+ there exist a U C S, \U\ := u := [^fj, such that \N(U)\ > s/3. Therefore, one 
can choose S € Cf as follows: 

1. Select u vertices of 3 Q n ; there are ( 3 M ) choices for this u. 

2. Select |_§ J — u neighbors of the selected vertices of u in part 1; there are maximum (2 n ) u 
choices, since there are at most ( 2 ™) + ( 2 ™) + ( ^) + •■■(2™) = 2 2n ways to find neighbors of a 
vertex in U. 

3. Select [^f J other vertices; there are at most (,3,,) choices. 
Hence: 



3 

W v " ' VLfJ 



|C+|< ( )(2 2 T( |2sl ) (5-84) 



and: 



T (l-^)" (S) < (t)( 22 T( L rj)(l-fn) b(8) (5-85) 



sec 



where: 



(1 ~Pn) Hs) < 3- sn s s (\ogn) 2s (5.86) 
consequently from 15.851 15.861 and 15.521 

V (1 -Pn) b[S) < (^)"2 2 ™(^)L*J3- s " S s (logn) 2s (5.87) 
sec+ L 3 J 

Write s = S* 3 ", (fi = ^), so that: 

3^<7r^^/3<l- 7l0g3l ° Sn (5-88) 
(log n) ' n 

Now, find a bound for the inequality 15.871 First calculate the first part of the inequality: 

( — ) u ? un {^7) m < )-3 2s 2 4 ^)* (5.89) 

e3™ 2s 

(since for large enough n and s\ < s < s 2 : (^-)~ < 1) (5.90) 

n 

< (3 2 2 4 (-e)§)^ = tftf"* 1 -^ = c s 3i s "( x -« (5.91) 
2 S3 

where c is a positive constant. Now, by substituting 15.91 1 in 15.871 one gets: 

J2 (l-Pn) HS) < 3- s ™s s (logn) 2 V3i ;m(1 ~ /3) (5.92) 



sec? 



= c s (logn) 2s 3 3 (5.93) 

< e s (logn) 2;i 3~ 7sl0 ^ 108 " , (fromEBQ (5.94) 

= c^logn) 2 ^ 108 ^ 108 ™'^ (5.95) 

< c s (logra) 2s (logra)^ (5.96) 
= c s (logn)^ (5.97) 
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and finally from 15.971 



E E (! -P»)" (S) < E c s (logn)T = (i) (5.98) 

s=s i sect 



Fourth interval s 2 + 1 < s < ^ and , s 2 = L ([„!„)» J , 5 £ C+: 

In H = 3 Q n [S] (the induced subgraph 3 Q n with the set of vertices S), the average degree is 
at least: 

2(log 3 s - log 3 n) > 2(n - 2 log 3 n) (5.99) 

since: 

3 n 3 n 
-^r^^WW^sXloga- (5.100) 

3 n 

=> log 3 s - log 3 n > lo g3( ^ log ^ 9 ) - lo 8'3 n>n- log 3 (logn) 9 - log 3 n (5.101) 

(for large enough n one can get n > (logn) ) (5.102) 

>n-21og 3 n (5.103) 

First, look for a subgraph of H with large average degree. Let T be the set of vertices of H with 
degree at least 2{n — (log 3 n) 2 ), and set t = \T\. From ISTMl one can conclude that the sum of 
degrees in H is at least s(n — 21og 3 n). We also know that: 

Sum of degrees in S < 2s(n — 2 log 3 n) (5.104) 

< i x (Maximum degree of vertices in set T of graph H ) (5.105) 

+ (s — t) x (Maximum degree of vertices in set 5 \ T of graph H ) 

(5.106) 

< 2tn + 2(s-t)(n- (log 3 n) 2 (5.107) 

4 

=^i>s(l-- ) (5.108) 

log 3 n 

Define Hi = 3 Q n [T] = H[T] as the induced subgraph spanned by T. We want to calculate 
|^Vffi(f )| m Hi, hence we should estimate the size of Hi and after that calculate the average 
degree in T. Let us first calculate e(Hi), the total number of edges in Hi. 

e(Hi) > e(H) - 2(s-t)n > 2-(n-21og,n) - r-^—n ( from IB^l and IBTTMl) (5.109) 

2 log 3 n 

One knows that the average degree in Hi is at least 2e \ Hl > ; and: 

^^>2(n-21og 3 n)--^>2n-^— (5.110) 
s log 3 n log 3 n 

(since: log 3 n 2 < for large enough n) (5.111) 

log 3 n 

Set u = L"^rJ- O ne should check the conditions of lemma [3] here. Let v = t, A = 2n,d > 

n 2 

2n — lo<r 9 - . So, one should check if 2n + 1 < < t — 2(n + 1), for large enough n. Clearly, 

°3 n "2" 
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2n + 1 < as n — > oo. It remains to prove -^-r < t — 2{n + 1), for large enough n. For 
minimum s from l5.108l we can get: 

3 I: 



t > 



7 Cl- 



io, 



§3 



-) rfrom ona 



(log n) 
3 n 

> — - + 2(n + 1) (for large enough n) 



Now, one can use lemma [3] and estimate \Nh 1 {U)\. 



1^(^)1 > 7^(2n 
1 iy n ~ 2n log 3 n 



9n , M , 2n+l 3" 
){l-exp( r )} 



2 log 3 n 



9 U1 , 2n + l 3" 
){l-exp( -)} 



712 



Let us estimate a bound for exp(— 2 " +1 -^r). One knows that t > s(l- 



Since max 



and max(s) = 4^-, then: 



3 B (2n+l) ^ 3 n (2n + l) (2n + 1) i 

; > ; = ; > n 4 (for large enough n) 



772 t 



77 2 3 ,; 



n 2n + l 3" . . i. . , . 

{1 — exp( -)} > exp(— 77 4 ) (for large enough n) 



t 772 

By using the bound from 15 . lTTI in 15 . 1T5I one gets: 
\N H (U)\ > |iV(^)| > \{2 - CX P (-77^)} 



i i 9 i 9 

= ;:{2 + cxp(-77 4 )- 2exp(-77 4 ) - — — } 



log 3 77 



l0g 3 77- 



i 9 i 1 
( lim exp(— 77 4 )- = and exp(— n 4 ) < (for large enoug 



log 3 n 



>-{2 

- 2 X 

s , 
> - 1 - 

- 2 V 



>-(2 
- 2 V 



l0g 3 77 l0g 3 77 

)(2- 



11 



l0g 3 77' 



l0g 3 77 



l0g 3 77' 



£ (2 
2 V 



log 3 7i 



11 



15 



l0g 3 77 l0g 3 77 l0g 3 77 



(for lar. 



l0g 3 77 



l0g 3 77 



5.112) 

5.113) 

5.114) 
5.115) 

t) = a 

5.116) 
5.117) 

5.118) 

5.119) 

in)) 

5.120) 

5.121) 

ge enough n) 
5.122) 
5.123) 



Now that we have \Nu(U)\, we can estimate a bound for |C^"| here. We know from 15. 1231 that 
for each S G C+ there exist a U C S, \U\ := u := [^-J, such that \N H {U)\ > s(l - ^-). 

Therefore, one can choose S € C+ as follows: 

1. Select u vertices of 3 Q n ; there are ( 3 u ) choices for this u. 

2. Select [s(l — lo I' 5 3 )J — u neighbors of the selected vertices in part 1. At most 2(log 3 77) 2 
of the 2ti neighbors of a vertex in U do not belong to Nh{U). Hence there are at most 
S(fcj) (IliLi Cj 1 )) wavs to find neighbors of u vertices in U, where the sum is over all (fci, k%, k u ), ki < 



35 



CHAPTER 5. CONNECTED RANDOM SUBGRAPHS OF THE 3-CUBE 



2(log 3 n) 2 . We know that: 



(fci) i=l V 1 ' (k 4 ) t=l 

X (2n)( 2 ( log 3" 



,(2n) 



"§S ( "( 2 ( lo g3«) 2 ) ! 

= E 

(*<) 

= (2(log 3 n) 2 ) 



(2n) 2u ^ log3 
((2(log 3 n) 2 )!)« 



2 „ (2n) 2u ( lo S3' 



((2(log 3 n) 2 )!)* 



< (2n) 2 " (log3Tl)2 



3. Select I , 7,5 I other vertices; there are (. ? 5 ,") choices. 
Hence: 

J2 (l-P«) b(S) < P")(2n) 2 "( log 3") 2 f f 5 V2»(n-l° 83 0(lo gn )2«(l-^) 
g£Q+ ^ ' \Llog 3 ra-l/ 



where: 



Therefore: 



where: 



£ (1-P«) 6(S) <3' 



e( s ) 



sec; 



e(s) = o(s) - 2s{n - log 3 s - log 3 log n + l g 3 log n} 



Since s < 4-, hence one can get: 



n - 1 

£{ s ) < °( s ) — 2s{n — (n — 1) — log 3 logn H log 3 logn} 

= o(s) — 2s{l — — log 3 logn} < — s 



(5.124) 
(5.125) 
(5.126) 

(5.127) 

(5.128) 
(5.129) 



(5.130) 
(5.131) 

(5.132) 
(5.133) 

(5.134) 

(5.135) 
(5.136) 



Therefore, for large enough n, one can get: 



£ £(1-a0'W< £ 3- = o(l) 
s=s2+i sect S=S 2 +1 



(5.137) 
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Chapter 6 

Connected random subgraph of 
the p3-product 



The P3 1 graph is the cartesian products of n copies of P3, where P3 stands for a path with length 
2. If one labels each vertex of P£ from to 3™ — 1, then two vertices are adjacent if the difference 
between their ternary representation is 1, in other words, vertex x = (xi, X2, x n ) is connected 
to the vertex y = (j/i, 1/2 , y n ) if for some i we have \xi — yi\ = 1 and Xj = Xj for j ^ i. Vertex 
x = (xxi x 2, •■•) %n) is connected to n + i vertices where i = \{j\xj = 1, j € {1, .., n}}\. A random 
subgraph of P£ contains all vertices of Pf, and each edge independently with probability p. p 
is called the percolation parameter and Pg p stands for the random subgraph of P3 1 

The main goal in this chapter is to explore a critical value p c , which for fixed values of p if 
p < p c then almost no random subgraphs of P3™ is connected, as n — > 00; but if p > p c then 
almost all random subgraphs of P3 1 are connected, as n — > 00. We suggest that this critical 
value is 2 — The proof in this chapter is not complete and there is a place for further work. 

In the first section of this chapter, it is proved that for p < 2 — \f2 almost no random 
subgraphs of P£ arc connected, as n — > 00. Then, it is proved that, for p > 2 — \pl almost 
all random subgraphs of Pf 1 have no isolated point, as n — s- 00. In the second section, the 
probability that random subgraphs of P£ , with the percolation parameter p > 2 — V2, have no 
components with size larger than 1 and smaller than 3™, as n — ¥ 00, is explored. 



6.1 Isolated vertices 
For p < 2 - y/2 : 



Let us define Xi and X for a graph G as follows: 

n a -4.- on v t \ f 1 Vertex i is isolated, i € V(G); , „, > „ , , 

Definition 30. A,(n) := < . J .. „„„ . , , , . Trf/ -r, ,andA(n):= > a 8 n 

y ' 10 Vertex z is NOT isolated, 1 € V(G). v ; ^ v 7 

As the first step we calculate E[X(n)]. One can categorize the set of vertices V into n + 1 
subsets ViS where x = [x\, X2, x n ) € Vi if = l,j 6 {1, .., n}}\ = i. Hence: 

n n j \ 

ft := E[X(n)) = £ E[X 3 {n)] = £ £ ^W**)] = E ( J 2 ""^ 1 = ^ ~ ^"( 3 " 

j'e^(P 3 %) i=ofceVj »=o ^ ' 

(6.1) 
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From (|6.1[) . the threshold value for E[X] is p c = 2 — \[2 which is the solution to the equation 
(1 - p)™(3 - p) n = 1. This means that E[X] -> oo for p < p C! but ] -> for p > p c . Now, 
one should calculate Var[X]. 

Var[X{n)]= ^ Var[Xi(n)}+ ^ Cov[Xi{n), X^n)] (6.2) 

n 

= E E Var[X k (n)] + ^ Cov[Xi(n), Xj(n)] (6.3) 
»=o fcev, i,jev(i^ p ),i/j 



where: 



n n / \ 

£ ^ Far[X fc (n)] = £ (") 2»" < ((1 - p)** - (1 - p) 

\2n/o i /I „\2^ 



(6.4) 



= (1 - p)"(3 - p) n - (1 - p) 2 "(2 + (1 - p)^)" (6.5) 

In order to calculate jev(P n ) i^j Cov[Xi(n), Xj(n)] one should know that if x £ Vi then x 
is incident to 2i vertices in Vi-% and n — i vertices in Vi+\. Also, one knows: 

Cov[X l (n),X :j (n)} = £'[X i (n)X 7 -(n)] - £[Xi(n)]£;[X,-(7i)] = if i,j not adjacent (6.6) 

Hence: 

J2 CouiXitn^Xjtn)] (6.7) 
i,jev(Pi p ),i^j 

= (^{(b-^I -p)" +i - 1 (l-p)"+ i+1 + 2i(l -p)"^" 1 ^ -p) n+i_1 } 

(6.8) 

= E (") 2-^(1 - V? n+2i - l {{n - i)(l - p) + -^} (6.9) 

i=0 ^ ' P 
n 

'n 



i=0 x 7 i=0 



(1 - P? n E ■ 2n_1 ( 1 - P ) 2i_1 {n(l - p) + - (1 - p))} (6.10) 

i=o P 

"(i - p) 2n E f 7 - ) 2 '" i ( 1 - pf l + (! - p) 2 " _2 ( 2 - (i - p) 2 ) E f 7 - ) 2 '" i ( 1 - ^ 

(6.11) 

- n(l - p) 2 "(2 + (1 - p) 2 )' 1 + (1 - p) 2 "- 2 (l - (1 - p) 2 )2"n (1 ~ p) (1 + )"- x 

(6.12) 

= n(l - p) 2n (2 + (1 - p) 2 )™ + 7i(2 - (1 - p) 2 )(2 + (1 - p) 2 ) 71 - 1 ^ - p) 2n (6.13) 
= 4n(l-p) 2 "(2 + (l-p) 2 )™- 1 (6.14) 

and finally, from (|6.5[) and ()6 . 14[) one gets: 

Var[X(n)} = (1 - p) n (3 - p)" - (1 - p) 2 "(2 + (1 - p) 2 )" + 4n(l - p) 2 ™(2 + (1 - p) 2 )^ 1 

(6.15) 

^-, 2 (2 i (1 -£ ) " + , 2 4 " (2 :: (1 '! )2) " (06) 



(3-P) 2 " " (3-p) 
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Now, since we have Var[X{n)\, we can use Chebyshev's inequality to estimate g n (p)- Cheby- 
shev's inequality states that: 

1 - P[P 3 " p contains an isolated point] = P[X(n) = 0] < P[|X(n) - fx\ > fi] < Var ^ n ^ 

(6.17) 

From RTTBI when p < 2 — ^/2, one gets Var[X{n)]/ ji 2 — > 0, as n — > oo. Therefore, from (|6.17p 
one gets lim„_>. 00 P[P 3 l p contains an isolated point] = 1. Finally, since 

PfP^p is connected] < P[P 3p does contains an isolated point] 
, then for p < 2 — \/2 the probability that P 3p is connected, as n — > oo, tends to 0. ■ 

For p > 2 - \/2: 

In order to calculate P[-P^ p contains an isolated point] when p > 2 — v2, as 71 — > 00, one 
can use the following inequality: 

P[P 3 " p contains an isolated point] = P[X(n) > 0] < E[X{n)\ = fj, (6.18) 

Since E[X(n)] — »• as n — >• 00, then limn^oo P[P 3 " p contains an isolated point] = 0. This 
means that the probability that there are no isolated points in P£ p for p > 2 — as n — » 00 , 
tends to 1. ■ 



^.^ Isolated components of size larger than 2 and smaller than 3 n 

Definition 31. C s is the family of subsets of V(P 3 rl ) with size s whose their induced graph is 
connected. 

Definition 32. The edge boundary ba(H), where H is an induced subgraph of G, is the number 
of edges which joins vertices in H to the vertices in G\H. Then bo(k) — min{bQ(H) : H C 
G, \H\ - k} 

Theorem 14. If p > 2 — \/2, the probability that for some S G C s , 2 < s < 3 44 ™. no edges of 
P 3 " p join 5 to V(P 3 ") \ 5, as n -t 00, tends to 0. 

Proof: 

Consider S C V = V(P^) and set b(S) — bp™ (H) where H is the induced subgraph of Pg" with 
the set of vertices S. One can observe that: 

P(No edges of P 3 " join S to V \ S) = (1 - p) b{s) (6.19) 

In order to prove the theorem, it is sufficient to show: 

go.44™ 

E E( 1 -f) 6(S) = °( 1 ) ( 6 - 2 °) 

«=2 sec 3 

From |16j one knows that for \S\ = s: 

b(S) > b(s) > \ s In - = ^-s(n - log 3 s) (6.21) 

3 s 6 
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one knows that: 

J2(l~P) HS) <\C s \(l-p) b(s) (6.22) 
sec B 

First, one should find a bound for |C S |. One has maximum 3™ choices to choose the first 
element for C s . The selected element is connected to maximum 2n vertices, therefore, there 
are maximum 2n choices to choose the second element. With the same logic, there are at most 
2n(s — 1) choices to choose the last element for \C S \. Hence, one can show: 

\C S \ < 3"(2n)(2n(2))...(2n(s - 1)) < (s - l)!(2n)' _1 3 n (6.23) 

and: 

|C S |(1 -p) b{s) < (s- l)!(2n) s " 1 3"(l -p)^ 1 ^™-^*) ( 6 .24) 

Set a := ^f-^- Assume that the right hand sides of inequality (|6.24[) is equal to A. After 
multiplying both sides of inequality (|6.24[) with 2 "^, + and then taking log 3 from both sides, 
one gets: 

log 3 (|a|(l-^>±^)<log 3 (A^^) (6.25) 

If \og 3 (A^^) -oo as n -> oo then should tend to for 2 < s < 3 44 ". This 

means that |C S |(1 - p) b(s)2s f^- tends to for 2 < s < 3 44 ™, as n -> oo. Therefore: 

s \ 

\C S \(1 - p) b{s) < - — — r for large values of n (6.26) 
2ns s+L 

which shows that: 

g0.44n 

s=2 sec s 

Finally, it remains to prove log 3 (A 2 "^| + ) — > — oo as n — > oo. One can verify this for s < n and 

s > n. After multiplying both sides of inequality (|6.24p with 2ns * and then getting log 3 from 
both sides, one gets: 

2n,s s+1 

logadC.Kl -p) b(s) gl ) < s(log 3 2n + log 3 s + an log 3 (l - p) - a(log 3 s) log 3 (l - p)) + n 

(6.28) 

For s < n the largest factor in equation (|6.28p is n(a(log 3 (l — p))s + 1) which is negative for 
p = p c and 2 < s < n. Therefore, log 3 (A 2 "^, + ) — > — oo as n — > oo for s < n. For s > n 
the largest factor in equation (|6.28l) is s(an(log 3 (l — p)) + (log 3 s)(l — alog 3 (l — p)) which is 

— alog3(l — p) s + 1 

negative if s < 3T=iT5lfti=iT" sa 3°- 44 », Therefore, log 3 (vl^i — ) -> -oo asn->oo for s > n.B 



For p > 0.67: 

Theorem 15. If p > 0.67, the probability that for some S G C s , 2 < s < 3", no edges of P 3 " p 
join 5 to ^(-Pf 1 ) \S, as n-> oo, tends to 0. Hence, the probability that P£ is connected tends 
to 1, as n — > oo. 
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Proof: 

We know that: 

\Cs\< [J <(— ) s (6.29) 

hence: 

(on\ p on 

J < (^-) s (l -p)^ s ("- lo S3 s ) (6.30) 

By using Mathematica one can show the right hand side of equation (|6.30p tends to zero as 



1 Mathcmatica is a computational software program developed by Wolfram Research of Champaign, Illinois 



41 



Chapter 7 



Reliable networks 



In many engineering applications, it is of interest to construct a graph (network), with a specific 
number of edges and vertices, which is the most reliable one in that family of graph with n 
vertices and m edges. One of the measures of reliability is all-terminal reliability. 

7.1 All terminal reliability 

Definition 33. Random subgraph of a graph, Percolation on a graph: Random sub- 
graph of G(V n ,E m ) is the graph G Pn which contains all vertices of G, and each edge of G 
independently with probability p n . Doing percolation on a graph with the parameter p n is the 
same as finding a random subgraph of a graph with the parameter p n . p n is called percolation 
parameter. 

Definition 34. Uniformly optimally reliable graph (UOR): The UOR graph, if it ex- 
ists, is the graph G n £ G(n,m) which maximizes the probability that G n is connected after 
percolation with the parameter p n for fixed n, m and all p n £ (0, 1). 

Definition 35. Reliability polynomial: Let Sk be the number of spanning connected sub- 
graphs of G n £ G(n,m) having exactly k edges. Let R(G ni p n ) be the probability that G n is 
connected after percolation with the parameter p n . One can formulate R(G n ,p n ) as follows: 



R(G n ,p n ) is called reliability polynomial of graph G n or all-terminal reliability. In this defini- 
tion, Si = for i < n — 1 and s m = 1. Also, s m _i is m— number of cuts in G n , and s n _i is the 
number of spanning tress of G n . 

The UOR graph, if it exists, is the graph G n £ G(n,m) which maximizes R(G n ,p n ) for all 
p n £ (0, 1). From the definition of reliability polynomial one can see that the value of reliability 
polynomial depends on the structure of a graph as well as percolation value. Trivially, for fixed 
values of p n there always exists an optimal solution for R(G,p n ). In other words, for fixed 
values of p n , there always exists a G n £ G(n,m) which maximizes R(G n ,p n ). The following 
theorem and corollary can be helpful to find the UOR graph. This theorem and corollary are 
extracted from |17j . 

Theorem 16. Let G and H be two undirected simple graphs both having n nodes and m edges 
and Sfc(G), Sk(H) denote the number of spanning connected subgraphs of G and H, respectively, 
with exactly k edges [T7j . 



m 




(7.1) 
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1. If there exists an integer < k < m — 1 such that Si(G) — Si(H) for i = 0, 1, fc and 
Sfc+i(G) > Sk+i(H), then there exists a p > such that for all < p < p we have 
R(G,p)> R(H,p). 

2. If there exists an integer < k < m such that Sj(G) = Sj(i?) for i = m, m — 1, m — k 
and s m _fc_i(G) > s m -k-i(H), then there exists ap < 1 such that for all p < p < 1 we 
have R(G,p) > R{H,p). 

Corollary 3. If G is UOR, then [17]: 

1. G has the maximum number of spanning trees among all simple graphs having n nodes 
and m edges, and 

2. G is max — A , i.e. has the maximum possible value of A among all simple graphs having 
n nodes and m edges, namely A(G) = [2m/n\, and the minimum number of cutsets of 
size A among all such max — A graphs. 

where A(G) is the edge connectivity of G, i.e., the minimum number of edges whose its removal 
will disconnect G. 

Important coefficients for large n: Let n be large and m sufficiently larger than n. When p n 
is close to then s„_i , the number of spanning trees of G„, has the most significant contribution 
in R{G ni p n ) since (1 — p n ) is almost 1 and p™ -1 is much larger than p" 1 . Similarly, when p n is 
close to 1, then s m _i, m— number of cuts, has the most significant contribution in R(G n ,p n ). 



Laplacian 

The Laplacian matrix of a graph is described briefly in chapter 1. Here, we present a few results 
on Laplacian. The author calculated the algebraic connectivity for all graphs with n = 5, 6, 7 
and n — 1 < m < (Z) and could not find a direct relation between algebraic connectivity and 
all-terminal reliability. There is a room for further work in this part. 

!deg{vi), if i = j; 
— 1, if i 7^ j and Vi adjacent to Vj] 

0, o.w. 

Arrange the eigenvalues of L as : Ai(L) < < ...X n (L). This set of Aj's are called the 

spectrum of L and A2 (L) is called the algebraic connectivity of a graph. The following interest- 
ing lemma sheds light on some applications of Laplacian matrix (this lemma is extracted from 
El): 

Lemma 5. For a graph G on n vertices, we have [53] 

(i) : 

^ A, < n 

i 

with equality holding if and only if G has no isolated vertices. 

(ii) : For n > 2 

n 

Ai < 



n- 1 

with equality holding if and only if G is the complete graph on n vertices. Also, for a graph G 
without isolated vertices, we have: 

n 

A n -i > 



71-1 
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n=5 


Sm— 1 


Sm-2 








s n -i 


m=5 


5 












m=6 


6 


12 










m=7 


7 


20 


24 








m=8 


8 


28 


52 


45 






m=9 


9 


36 


82 


111 


75 




m=10 


10 


45 


120 


205 


222 


125 



Table 7.1: The coefficients of the reliability polynomials of the UOR graph for \ 



and n — 1 < m < (™) . 



(iii:) For a graph which is not a complete graph, we have 

Ai < 1. 



(iv:) If G is connected, then Ai > 0. If A; = and Ai+i ^ 0, then G has exactly i + 1 connected 
components. 

(v): For alH < n — 1, we have: 

A,: < 2 



with 



A n -i — 2 



if and only of a connected component of G is bipartite and nontrivial. 

(vi:) The spectrum of a graph is the union of the spectrum of its connected components. 



Reliability for n = 5,6,7 

The author calculated R(G,p) for n = 5,6,7 and n — 1 < m < Q). The coefficients of the 
reliability polynomials are presented in tables I7TT1 17. 2l a.nd l7. 31 As t able 1 7 . ll illustrates . for n = 5 
there is always a UOR graph. For n — 6, 7 there are always a UOR graph except two cases. 
If, (n,m) = (6, 11), the optimal solution for R(G n ,p) depends on the value of p. For p < 0.29 
(approximately 0.29) graph 7.1(a) is optimal while for p > 0.29 graph 7.1(b) optimal. Also, 
if [n, m) = (7, 15), the optimal solution for R(G n ,p) depends on the value of p. For p < 0.81 



(approximately 0.81) graph 7.2(a) is optimal while for p > 0.81 graph 7.2(b) is optimal. From 
these observation one can conclude that the UOR does not exist for all values of n and m. 



Reliability for m = n — 1, n,n + 1, n + 2, n + 3 

For m = n — 1, n, n + l,n + 2 there always exists a UOR graph. For m = n — 1, any tree is 
the UOR graph. For m = n, C n , single cycle with n vertices, is the UOR graph. The first 
non-trivial case is m = n + 1, which is solved by F. Boesch [3T], [2D]. The UOR graph in this 
case is: for n > 5, start with a multigraph with 2 vertices and 3 edges. Then add total of n — 2 
vertices of degree 2 in each lines of the graph so that the number of vertices in each line differs 
by at most one [20]. For m = n + 2 the problem is also solved by F. Boesch. The UOR graph 
in this case is: start with K4, then add total of n — 2 vertices of degree 2 in each lines of the 
graph so that the number of vertices in each line differs by at most one [5D]. For m = n + 3, 
the UOR graph is found by G. Wang [55]. The UOR graph in this case is: start with a 
complete bipartite graph with 3 vertices in each part, and then add the remanning vertices as 
before. 
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n=6 


^771 — 1 


















°n — l 


— 


6 




















m=7 


7 


16 


















m=8 


8 


26 


36 
















m=9 


9 


36 


78 


81 














m=10 


10 


45 


116 


177 


135 












m a — ll 


11 


55 


163 


309 


368 


225 










m b — ll 


11 


55 


163 


310 


370 


224 










m=12 


12 


66 


220 


489 


744 


740 


384 








m=13 


13 


78 


286 


771 


1249 


1552 


1292 


576 






m=14 


14 


91 


364 


999 


1978 


2877 


3040 


2196 


864 




m=15 


15 


105 


455 


1365 


2997 


4945 


6165 


5700 


3660 


1296 



Table 7.2: The coefficients of the reliability polynomials of the UOR graph for n = 6 and n — 1 < m < (™). 
For m = 11, and p < 0.29 (approximately 0.29) the row with m a is the UOR graph and for p > 0.29 the row 
with is the UOR graph. 



m=9 

11=10 

a=ll 

0=12 

a=13 

0=14 

i a =J5 

x h =15 

n=ie 

q = 17 
a=18 



91 
105 
105 
120 
136 
153 



10 1 
159 
2 1 6 
284 
80 1 
455 
455 
560 

oso 

8 1 6 



117 
273 
456 
000 
99 1 
1360 
1360 
1817 
2379 
3060 



231 
612 
1146 
1932 
2946 
2946 
4328 
6169 
8. '.02 



182 
1248 
2668 
4704 
4705 
7766 
1226 
18485 

m 



720 
2460 
5464 
5465 
1054S 
18762 
31344 



1200 
4320 
4305 
10628 
22226 
41964 

X 



1840 
1805 
7396 
1980S 
44000 

ao 



2800 
12320 
35094 



4200 
19716 



Table 7.3: The coefficients of the reliability polynomials of the UOR graph for n = 7 and n — 1 < m < Q). 
For m = 15, and p < 0.81 (approximately 0.81) the row with m a is the UOR graph and for p > 0.81 the row 
with is the UOR graph. 




Figure 7.1: (n, m) = (6, 11), For p < 0.29 (approximately 0.29) graph (a) is optimal while for p > 0.29 graph 
(6) is optimal. 
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i 



i 



6 



5 



3 



5 



4 



2 



(a) 



(b) 



Figure 7.2: (n, m) = (7, 15), For p < 0.81 (approximately 0.81) graph (a) is optimal while for p > 0.81 graph 
(6) is optimal. 

Family of counterexamples 

Kelmans [TH] and Myrvold et al. [T!5] found infinite families of counter examples which the UOR 
graph does not exist. As an example, for n even and n > 6 and m = n(n — l)/2 — (n + 2)/2, 
or for 7i odd and n > 7 and m = n(n — l)/2 — (n + 5)/2 there always exists a graph in which it 
maximizes R(G,p n ) for p close to 1, but do not have the maximum number of spanning trees. 
Therefore, from theorem [TBI and corollary [3] the UOR graph does not exists for these families. 

7.2 Random accessibility 

M. Ebncshahrashoob, T. Gao and M. Sobel introduced the concept of random accessibility for 
simple graphs . They believe that finding the UOR graph is related to the concept of random 
accessibility. In random accessibility, the goal is to find the expectation and the variance of the 
number of transitions Xj needed to visit j new vertices in G € G(V n , E m ). In this approach the 
starting point is not considered as a new vertex. The result of the expectation and the variance 
can depend on starting point. Hence, one should change the starting point depending on the 
degree of it as a weighing factor. If one considers a graph with enough symmetry, the result 
does not depend on starting point. From analyzing numerical results, they make the following 
interesting conjuncture: 

• If the family of graphs contains both regular and non-regular graphs, then the UOR graph 
is among the regular graphs. Also, the expectation for random accessibility of graphs is 
equal or greater than the corresponding result of the UOR graph for each value of j close 
to m — 1 (with the same ordering of the graph as for all-terminal reliability). 
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